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The Theoretical Efficiency of A Test 


By FRANKLIN HENRY 
University of California 
‘ Berkeley, California 


(Submitted for publication, December, 1946) 


is commonly believed that the effectiveness of a test is proportional 
[- the validity coefficient r (or some index derived from r) relating 
test scores and criterion scores. Taylor and Russell** have objected to 
this concept and have in fact shown that a test with a validity coefficient 
of r = .50 will in some circumstances be more effective than a test with 
a validity of approximately r = 1.00. Johnson® has pointed out that in 
the case of a 2x2 contingency table analysis, it may be more satisfactory 
w change the critical passing score than to attempt to increase the 
validity of a test. 
Taylor and Russell? have proposed the use of selection ratio tables, 
which make it possible to predict the proportion of success and failure 


in a test-selected group of individuals if the following information is 
available: 


1. The proportion of unselected individuals who are successful in 
the criterion situation. 

2. The coefficient of correlation relating unselected test scores to 
criterion success and failure. 

3. The proportion of individuals who are retained in the selected 
group. 


This method of evaluating test efficiency proved quite valuable 
in the Army Air Forces’ wartime psychological research on aircrew selec- 
tion. An extension of the method was also found useful in estimating 
the value of tests of fitness for flying at extremely high altitudes.1 In 
the latter case, insight into testing theory gained by the application of 
this method made it possible to reconcile differences in experimental 


results that appeared quite conflicting when analyzed by conventional 
methods. 4 


It is the purpose of the present study to show that if the selection- 
ratio method is derived on a broader basis than was considered by Taylor 
and Russell, it is possible to investigate phases of the problem that were 
outside the scope of their study. Two aspects will be of particular inter- 
ect, namely: 





From the Department of Physical Education and the Division of Medical Physics. The 
work described in this report was supported, in part, by a contract recommended by the 
Committee on Medical Research, between the Office of Scientific Research and Develop- 
ment, and the University of California. 

*Superior figures refer to numbered bibliography at end of article. 
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1. Delineation of the relationships between degree of selection and 
degree of criterion success for specified validity coefficients, leading to 
a rational basis for determining the best placement of “cutting scores” 
for selection or rejection of tested individuals, and for deciding whether 
it is worthwhile to use a certain test in some particular situation. 

2. Determination of the effectiveness of a test in eliminating poten- 
tially successful individuals from the rejected group. This aspect is 
extremely important in some situations, as for example when individuals 
are excluded from professional training on the basis of test battery 
scores, or are advised by a vocational counselor to avoid certain trades 
or professions. 

It should be pointed out that there are situations in which the 
results of the present study do not apply. In research applications, for 
example, interest may be centered in the relation between test scores and 
criterion scores. However, these applications are not directly concerned 
with selection. But there are many other situations in which success or 
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Fic. 1. Scatter diagram relating sigma scores of 100 students taking a one hour 
test to sigma scores on a criterion consisting of a three-hour comprehensive examina- 
tion. The validity coefficient is r = .80. 
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fwilure must necessarily be defined in terms of pass or fail. An individual 
does or does not graduate from school; he does or does not win a race; 
he does or does not get or lose a job, have an accident, or become 
married or divorced. 


SOME FUNDAMENTAL RELATIONSHIPS 

In Figure 1 is shown a scatter diagram relating the scores made by 
100 students who took a one-hour examination and a three-hour com- 
prehensive final examination in a course in physiological hygiene. The 
correlation between the test and final was r = .80. Assuming for pur- 
poses of exposition that the final grade in the course was determined 
only by the comprehensive examination, the fifteen percent of the stu- 
dents who received less than grade C failed to do satisfactory work. It 
is of interest to calculate the effectiveness of the short test in selecting 
the satisfactory students. Suppose the poorest 15 percent in terms of the 
short test scores are discarded (failed). This procedure will eliminate 
all individual points to the left of the vertical 85 percent line. Now 
there were 85 successes in the unselected group. By simply counting 
the points in the figure it is apparent that five of these (ie., the points 
to the left of the 85 percent line but above the horizontal line P-P’) will 
be discarded; these are test “misses.” The proportion of success in the 
retained (selected) group will therefore be 85 minus 80, divided by the 
tctal 85 who were retained. The result is 0.94, or 94 percent success 
in the selected group. The results with other degrees of selection appear 
in Table I. It is necessary to select the best 60 percent (ie., reject 40 
percent) in order to eliminate all the individuals who will fail in the 
criterion situation. But rejecting only 10 percent achieves considerable 
purification of the selected group; this yields 92 percent success com- 
pared with only 85 percent without any selection. 


TABLE I 


PERCENT SUCCESS IN SELECTED INDIVIDUALS IN A CRITERION SITUATION INVOLV- 
ING A HIGH PROPORTION OF SUCCESS IN THE UNSELECTED GROUP (85 PERCENT) 
AND A LOW PROPORTION (15 PERCENT).* 


Pro ion Success in Success in 
Selected Calculation SelectedGroup Calculation Selected Group 

A he tl aa We eee. 2 oe 15.0% 
90 (85-2)/90 92.2 (15-0)/90 16.7 
80 (85-8) /80 96.2 (15-0)/80 18.8 
70 (85-17)/70 97.1 (15-0)/70 21.4 
60 (85-25)/60 100.0 (15-0)/60 25.0 
50 (85-35)/50 100.0 (15-0)/50 30.0 
40: (85-45)/40 100.0 (15-0)/40 37.5 
30 (85-55)/30 100.0 (15-1)/30 46.7 
20 (85-65)/20 100.0 (15-3)/20 60.0 
10 (85-75)/10 100.0 (15-7)/10 80.0 


*The validity of the test is r = .80. Data from 100 individuals plotted in Figure 1. 

















THE THEORETICAL EFFICIENCY OF A TEST 93 


In the example just given, there was a relatively high degree of 
criterion success (85 percent) in the unselected group. Consider now 
the situation if criterion success is low. The same data may be used to 
examine this situation if it is considered that only students who made 
grade A in the final examination are truly successful. Only students 
above the line Q-Q’ in Figure 1 are successful by this standard. If 15 
percent of the unselected students are discarded and 85 percent selected 
by the short test, the proportion will be 15 minus zero divided by 85, 
or 0.176, ie., 17.6 percent success as compared with 15 percent in the 
unselected group. In this circumstance, the test is without value. As 
shown in the right-hand portion of Table I, high success in the selected 
group will be achieved only if a very high proportion of the students are 
discarded. The results may be summarized as follows: The short test 
having a validity of r = .80 is quite effective for selecting students who 
are “C” or better, but is practically worthless for selecting “A” students. 
Furthermore, it is wasteful to discard too many students for “C” selec- 
tion, but it is essential to discard a large proportion if good results are to 
be secured when testing for “A” students. 

Another possible use of the short test might be comparable to the 
ccunseling problem mentioned earlier, using the test as a basis for advis- 
ing students to drop the course. In the first case, grade C or better 
represents success as before. If fifteen percent are discarded on the basis 
of the test, there will be 10 criterion failures (i.e., those who are below 
the line P-P’ and to the left of the vertical 85 percent line). Ten failures 
in fifteen discards is 67 percent, which does not seem to be a very high 
proportion of failures when it is realized that 33 percent of these dis- 
carded individuals would actually be successful, if allowed to continue 
the course. In the case of discarding 40 percent, which served to purify 
completely the “accepted” group, the results are very bad indeed from 





TABLE Il 
PERCENT FAILURES IN REJECTED INDIVIDUALS IN THE CRITERION SITUATIONS 
or TABLE I. 

Proportion Failure in Failure 
Selected Calculation Rejected Group Calculation Rejected Group 
HWOQ% i ae ee 
90 8/10 80.0% 10/10 100.0% 
80 12/20 60.0 20/20 * 100.0 
70 13/30 43.3 30/30 100.0 
60 15/40 37.5 40/40 100.0 
50 15/50 30.0 50/50 100.0 
40 15/60 25.0 60/60 100.0 
30 15/70 21.4 69/70 98.6 
20 15/80 - 18.8 77/80 96.2 
10 15/90 16.7 83/90 92.2 


0 15/100 15.0 85/100 85.0 
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the counseling point of view, for here there will be only 38 percent 
failures in the rejected individuals, and 62 percent would in fact be 
successful in the course if allowed to continue. 

This result has occurred because, in the situation that involves a 
reasonably high proportion of unselected success in the criterion, the 
“test misses” fall into the rejected group, whereas if the proportion of 
unselected criterion success is relatively low, the “misses” fall into the 
retained or selected group. This accounts for the poor results in attempt- 
ing to select the “A” students by the short test. The conclusion may 


- be confirmed by inspection of Table II, which shows that students 


rejected by the test can be advised with some confidence that they will 
fail to secure an “A” grade. Summarizing the implications, it may be 
stated that when a test is used for counseling purposes, individuals 
advised to avoid a profession or trade on the basis of poor aptitude test 
scores will be poorly advised if the criterion situation involves a rela- 
tively high degree of success, because in this circumstance there will be 
a large proportion of “test misses” in this rejected group, even with 
tests of high validity. However, if the criterion situation involves a low 
proportion of success among unselected individuals, ‘test misses” will 
be rare and such counseling can be justified. 


SELECTION-EFFICIENCY CURVES 


Five members of the physical education faculty were asked to rate 
major students as to their professional competence. A five-point rating 
scale was used. Forty-eight men were rated by all five of the judges. 
These data will serve to illustrate the application of selection-efficiency 
curves. 


The ratings of a single one of the judges will be used as a selection 
test. There will be three criteria: 


(A) Agreement by the other four judges that a particular indi- 
vidual is highly competent. 
(B) Averaged ratings of these four judges indicating that an indi- 


- vidual is above or below an arbitrary standard of medium competence. 


(C) Agreement of the four judges that an individual is above a 
minimum acceptable level of professional competence. 

Biserial correlations relating the ratings of the first judge (the test 
scores) to success or failure in passing the three criteria have been 
calculated; the coefficient is r — .76 for (A), r — .71 for (B), andr 
== .75 for (C). The degree of success of the unselected group in 
criterion (A) was 18.8 percent, in criterion (B), 56.3 percent, and in 
criterion (C), 83.3 percent. These data have been used to enter each 
of the eleven Taylor and Russell selection-ratio tables in order to secure 
the information needed to construct the theoretical curves shown in 


- Figure 2. In order to check the applicability of these tables to the present 


situation, actual experimentally determined values for the different 
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Fic. 2 (left). Amount of criterion success in a selected group with a difficult criterion (A), a moderate criterion (B), and an easy 
criterion (C) in relation to the proportion of individuals discarded by the test. The smooth curves are theoretical; the plotted points 
were experimentally determined. Fic. 3 (right). Amount of criterion failure in the rejected group from Figure 1. Note that the 
vertical axis has been inverted; purity of the tray group can be attained only if a small number are rejected. The validity 
coefficients in both figures ranged from r = .71 to r = .76. 
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degrees of selection afforded by the rating scale have been calculated by 
the method described in connection with Table I, and have been plotted 
in Figure 2 (the fit of the experimentally determined points to the 
theoretical curves is exceptionally good; it cannot be expected that data 
on only 48 individuals will typically show such high agreement). 

Referring to the left-hand curve of Figure 2, it may be seen that 
it tends to be comparatively vertical for small amounts of selection. 
Really good purification of the selected group can only be expected if 
a large proportion of individuals are discarded. This tendency is charac- 
teristic of curves for low unselected criterion success and would have 
been more striking had the unselected success been less than in this 
example. The middle curve tends to be almost straight, implying that 
the degree of purification of the selected group is linearly proportional 
to the proportion of individuals discarded. This tendency is characteris 
tac of a situation where unselected success approximates 50 percent and 
the validity coefficient is fairly high. With lower validity, the 50 percent 
curves tend to be more vertical with a knee of right-hand curvature at 
the upper end. The right-hand curve of Figure 2 clearly demonstrates 
that with a high degree of unselected criteriot success, the greatest 
amount of purification of the selected group is achieved by discarding 
relatively few individuals; in other words, the selection efficiency here 
obeys a law of diminishing returns. This is typical of situations involving 
75 percent or higher unselected criterion success, particularly if the 
validity coefficient is fairly high (i.e., greater than r — .60). 

For use of the judges’ ratings in counseling, the essential informa- 
tion is shown in Figure 3. The theoretical curves in this figure were 
constructed from the tables by a method that will be described pres- 
ently; the experimentally determined points were calculated by the 
method described in connection with Table II. Consider first the right- 
hand curve, which concerns the criterion involving a high standard of 
success that was attained by only 18.8 percent of the unselected group. 
Men who are given the lowest rating are sure to fail this criterion and 
men given either of the two lowest ratings (a category which includes 
60 percent of the unselected group) are found to show a 97 percent 
incidence of failures. The effectiveness of the test improves as the number 
of men rejected is decreased, but the increase in efficiency obeys a law 
of diminishing returns. In this situation counseling is effective; the “test 
misses” are in the selected group (curve A of Figure 2) rather than in 
the rejects. 

In contrast, the efficiency with the criterion involving a low stand- 
ard of success that was attained by 83.3 percent of the unselected group 
obeys a law of increasing returns as the number of men rejected is 
decreased; however, the effectiveness never gets high enough to justify 
the use of the test for counseling the rejects in this situation (except 
perhaps in the case of the poorest three or four percent of the unselected 
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men). This result occurs even though the rating test had a validity of r 

.75, considerably higher than is usually encountered in aptitude 
tests. The reason for the poor results is of course explained by the fact 
that the “test misses” are now in the rejected group; this is why the 
test was effective in purifying the selected group with respect to success 
in the 83.3 percent criterion (curve C, Figure 2). 


When the rating test is used in connection with criterion situation B 
(56.3 percent of unselected success), the efficiency is approximately 
equal with respect to counseling the rejects and improving the selected 
group; here, the test misses are divided almost-equally between the 
selected group and the rejects. Furthermore, efficiency tends to be a 
linear function of the smallness of the selected or rejected group (middle 
curves of Figures 2 and 3). 


Comparison of Figures 2 and 3 shows that the most effective place- 
ment of a “cutting” score is very different, depending on whether the 
test is to be used for improving the selected group or counseling the 
rejected group. In the case of the 18.8 percent or low-success criterion 
(curves A), improvement of the selected group requires a high degree 
of selection; efficiency increases rapidly in the upper regions of the 
curve. But if the test is used for counseling the rejects, a relatively low 
cutting score is best, because the efficiency increases very slowly in the 
upper regions of the curve. 


In the case of the 83.3 percent or high success criterion (curves C), 
a fairly low placement of the cutting score is desirable if the test is 
being used to improve the selected group, because efficiency changes 
very slowly in the high regions of the curve. On the other hand, if the 
test is being used for counseling the rejects, it is absolutely worthless 
unless the cutting score is placed very high; efficiency increases rapidly 
in the upper regions of the curve. It seems relevant to mention at this 
point that, in general, counseling of rejects is concerned with criterion 
situations in which unselected success is well above 50 percent, and 
curve C is probably a fairly typical curve. 


Several other examples of the agreement between the theoretical 
curves and experimental results are shown in Figure 4. In connection 
with a study of exercise at high altitude,? 26 men were given a synthetic 
38,000 ft. “flight” in a decompression chamber while engaging in mus- 
cular activity. Subsequently the same men were given another “flight” 
with a different exercise. Self-ratings of the severity of the altitude pain 
experienced by these men on their first exposure may be used as altitude 
tolerance scores; individuals who had no symptoms of decompression 
sickness during the second exposure may be considered successful in 
their second or validating flight, while the men who had symptoms may 
be considered as having failed the criterion. The unselected men showed 
23.5 percent criterion success. The biserial correlation between test 
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scores and criterion results is r —= .64. This information, with the aid 
of the tables, makes it possible to construct curve D. 

In another high-altitude experiment, 148 men were “flown” at 
33,750 ft. and scored as to the time of the first appearance of altitude 
pain. The presence or absence of symptoms of decompression sickness 
in a subsequent exposure was noted. The unselected criterion success 
was 36.6 percent, and the validity coeffcient was r — .68. These data 
yield the middle curve of Figure 4. The third altitude experiment, on 
134 men, made use of pain intensity scores in the first “flight.” Failure 
in the second or criterion flight was defined as the occurrence of intoler- 
ably severe symptoms. The unselected group showed 77.2 percent success 
by this criterion; the validity coefficient was r — .52. Curve E shows 
the efficiency of the test. In all three of the experiments, the actual 
occurrence of criterion success at various degrees of selection has been 
calculated as in the Table I data, and plotted on the graph. 

It will be noted that the agreement between theory and experiment 
is exceptionally close in the case of curve E of Figure 4. This is a very 
interesting observation because the test scores in this example were highly 
skewed, in contrast to the relatively normal distributions of the scores in 
D and F. Comparison of the agreement between theory and experiment 
in over a hundred curves similar to these!+4 indicates that the method 
can be used with skewed distributions, at least for validity coefficients 
smaller than r — .80, provided that biserial correlations are used. Such 
usage is of course a violation of the assumption of a normal bivariate 
correlation surface although the statistical theory is based on this assump- 
tion.® 7 Practically, the most important assumption appears to be the 
postulation that the correlation between test and criterion is uniform 
throughout the range of scores. On the whole, the theory appears to 
underestimate slightly the amount of improvement in the selected group. 


THEORETICAL TABLES AND CURVES 


Most accurate are the Pearson tables® of the volume of the normal 
bivariate surface, familiar to statisticians for their application in calcu- 
lating positive and negative tetrachoric correlations. In the present appli- 
cation, Pearson's Table VIII may be used for estimating improvement 
in the selected group and Table IX may be applied to the rejects. Unfor- 
tunately, these tables are not arranged in the most useful form for the 
present purpose; furthermore they will not be very intelligible to most 
non-mathematicians. The Taylor and Russell selection-ratio tables? are 
easier to understand and have adequate accuracy for most purposes. 

The writer has prepared a set of tables* of the form shown in Table 
III that will be found useful if the problem of testing is visualized along 
the lines of the present paper. An advantage of this form is that a 
single table serves to give the purity of both the selected group and the 
rejects. To illustrate: consider the situation for r — .60 with 20 percent 
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unselected failure in the criterion. Enter the 20% column (third from 
the right) to find the percent failures in the selected group as a function 
of the percent of unselected men discarded by the test (extreme left- 
hand column). For example, if the 10 percent of individuals making 
the poorest test scores are discarded, there will be 16 percent failures, 
in the selected group; if 20 percent are discarded, there will be 13 percent 
failures; if 30 percent are discarded, there will be 10 percent failures, 
and so on. 

In the unselected group, with 20 percent success there were 80 
percent failures. Enter the 80% column, but this time from the top 
dewnward and read 40 percent success in the rejects if 10 percent of 
the unselected group is discarded, 50 percent success in 20 percent dis- 
cards, 57 prcent success in 30 percent discards, and so on. For any entry 
in the body or margin of this table, a result given in percent success 
can be converted to percent failure by subtracting from 100, and a 


result given as percent failure can be converted to percent success. by 
subtracting it from 100. 


TABLE III 


(From Henry, 1946*) 
TEST EFFICIENCY FOR r = .60 


PERCENT FAILURES IN THE SELECTED GROUP AS A FUNCTION OF THE PERCENT 
OF THE UNSELECTED GROUP THAT IS DISCARDED.* 


90% (10%) 61 40 26 17 #4210 6 3 1 0 0 
80% (20%) 70 50 36 25 16 = #10 5 2 1 0 
70% (30%) 75 57 42 #31 «21~=«13 8 4 1 0 
60% (40%) 79 62 48 36 25 #17 += « 10 5 1 0 
50% (50%) 82 66 53 40 30 20 = «13 6 2 1 
40% (60%) 84 70 57 45 34 24 #15 8 3 1 
30% (170%) 86 173 60 49 38 27 #18 = «10 4 1 
20% (80%) 88 76 64 52 41 31 #21~=« 13 5 2 
10% (90%) 89 78 67 56 46 35 25 16 7 3 


0% (100%) 990 80 70 60 50 40 30 20 #10 5 


*The table also gives the percent success in the discarded group as a function of 
the percent of the unselected group that is discarded (use figures enclosed in 
parentheses). 


It has also been found useful to prepare a set of curves for each 
magnitude of correlation coefficient by 10’s. Such a family of curves 
is shown in Figure 5. Graphic interpolation can readily be made for 
any amount of unselected success, while the use of two magnitudes of 
correlation in each graph makes it easy to interpolate between a range 
of ten points in correlation. Confusion may be reduced by lightly shad- 
ing with a pencik the areas bounded by each pair of lines. These graphs 
are not very accurate for extremely high levels of rejection, but the 
tables can be resorted to for the infrequent occasions when this region 
is being used. It may be noted that by simply relabeling the horizontal 
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axis of Figure 5 to read “Percent failure in rejects” and inverting the 
numbers on the vertical axis so that 100 is at the bottom and zero percent 
is at the top, the curves may be used to predict results with the rejected 
group. 

Inspection of Figure 5 with respect to the horizontal distances 
bounded by the pairs of lines reveals that alteration of a validity coeffi- 
cient by as much as ten points does not result in a very great change 
in test efficiency in a considerable portion of the possible situations. 
Variations in validity have a modest effect when unselected success is 
high; for medium amounts of unselected success, altering the validity 
coefficient has a considerable effect if half or more of the unselected 
individuals are discarded by the test. If unselected success is low, the 
effect is large when a large proportion is discarded. These observations 
apply to purification of the selected group. In the case of rejects, altera- 
tion of the validity coefficient has the greatest effect when unselected 
failure is high; when less than half of the unselected individuals are 
rejected if unselected failure is in the region of 50 percent; at low 
levels of rejection when unselected failure is infrequent. 

When a test is used for practical purposes, its true validity in the 
particular application is seldom known; similarly, the true proportion 
of unselected criterion success will not often be known with much 
accuracy. Nevertheless, some reasonable approximation of these statistics 
must be made if use of the test is to be justified by any method of evalua- 
tion. Using such approximations it should always be possible to decide 
at least whether to use the right-hand, middle, or left-hand parts of 
Figure 5 (or a similar figure for some other range of validity coeffi- 
cients). This will provide a rational basis for correctly placing the 
“cutting” score, and, equally important, will serve to prevent injudicious 
placement. If the cutting score is fixed by policy, as would be the case, 
for example, if a fixed number of professional students were accepted 
from all applicants for training, or if it were customary to fail a certain 
proportion of students in a large class, it is a simple matter to estimate 
the reduction in failures fairly accurately even if unselected success and 
validity are approximations. This is possible because adjacent curves in 
diagrams such as Figure 5 tend to be similar in form. Obviously the 
above arguments apply equally well if interest is centered in the counsel- 
ing situation. 


SUMMARY AND CONCLUSIONS 


Analysis of a scatter diagram of individual test scores plotted 
against criterion results reveals that the efficiency of a test varies‘ widely 
depending on the proportion of individuals selected by the test and the 
unselected incidence of success and failure, even though the validity 
coefficient remains constant. If the amount of unselected success is high 
and a comparatively small proportion of individuals are selected, “test 
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misses” tend to fall in the rejected group, making the test efficient for 
improving the selected group and inefficient in purifying the rejected 
group. However, if the proportion of unselected success is low, the 
“test misses” tend to fall in the selected group. This results in low test 
efficiency insofar as the selected group is concerned, but efficiency is 
high with respect to purifications of the rejected group. When unselected 
success and failure are about equal, the “test misses” tend to be dis- 
tributed equally between the selected and rejected groups. 


Using the theoretical volumes of the normal bivariate correlation 
surface, selection-efficiency curves may be constructed for specific pro- 
portions of unselected success and magnitudes of test validity. These 
curves are found to agree very closely with experimentally determined 
test results. They may be arranged to give predictions of criterion success 
(or failure) in the selected group resulting from various percentages of 
selection, or to give predictions of success or failure in the rejected group. 
For practical purposes, it is found convenient to use superposed curve- 
families for two magnitudes of correlation coefficient. This technique 
permits rapid interpolation of both correlation and criterion success 
with sufficient accuracy for most applications. Simplified test-efficiency 
tables are also valuable; an example of such a table is shown. The method 
is useful for test evaluation even though validity and unselected success 
are only approximately known. 

Study of the selection-efficiency curve system leads to the follow- 
ing generalizations: 

1. The incidence of criterion success and failure in the selected and 
rejected groups can be determined for any specified cutting score on a 
test, if the validity coefficient and urselected incidence of success or 
failure are known. 

2. For any particular magnitude of the validity coefficient, a large 
improvement in the selected group results from discarding relatively 
few individuals, and further discarding obeys a law of diminishing 
returns, when the incidence of unselected criterion success is high. 

3. When there is a low incidence of unselected success in the 
criterion, improvement obeys a law of increasing returns, but a large 
proportion of the unselected group must be discarded before there is 
much improvement in the selected group. 

4. When unselected success is low, there is a high degree of purity 
of the rejected group when relatively few individuals are discarded by 
a test, but the rejects become mixed with an increasing proportion of 
“misses” as more and more individuals are discarded. 

5. With high unselected criterion success and relatively few indi- 
viduals discarded by the test (the most desirable situation for improving 
the selected group), the rejected group is extremely impure if more 
than a few percent are discarded. 
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6. When unselected success is close to 50 percent, improvement 
of the selected group tends to be linearly proportional to the percent 
of individuals discarded. In contrast, the rejected group tends to become 
increasingly impure in linear proportion to the percent discarded. 

7. Alteration in the magnitude of a validity coefficient has rela- 
tively little effect on the improvement of the selected group when 
unselected success is low and few individuals are discarded; the effect 
is greatest when over half are discarded and unselected success greater 
than 50 percent. On the other hand, the purity of the rejects is most 
affected by alteration in validity when unselected success is low and rela- 
tively few individuals are discarded. 

8. All these effects are small at low levels of validity, but become 
quite important for validity coefficients that are large enough to justify 
the use of a test in a practical situation. 


BIBLIOGRAPHY 
1. Henry, F. M., “Altitude Pain,” J. Aviat, Med. 17:28-55 (1946). 


oo. eee “The Role of Exercise in Altitude Pain,” Am. J. 
Physiol., 145:279-284 (1946). 


3. mune Statistical Methods and Tables of Test Efficiency. 
2 ad Un iversity ‘of California, 1946 (Mimeographed—available on request). 


ROE BRS and A. C. Ivy. “Preselection Tests,” in Decompres- 
sion po J. F. Fulton, ed. In press, 1947. 


5. Johnson, H. M., “Maximal Selectivity, Correctivity and Correlation 
Obtainable in 2x2 Contingency-tables,” Amer. J. Psychol. 58:65-68 (1945). 


6. Pearson, K. Tables for Statisticians and Biometricians, Part 2. London: 
Cambridge University Press, 1931. 


7. Taylor, H. C., and J. R. Russell, “The Relationship of Validity Co- 
efficients to Practical Effectiveness of Tests in Selection,” , 
23:565-578 (1939). 


Appl. Psychol., 








pr a Deh 


i ea Lie 


4 
| 
i 





er 











Hidden Possibilities For Research In 
Physical Education and Recreation 


By FLORENCE STUMPF AND FREDERICK W. CozZE?'s 
University of California 
Berkeley 


(Submitted for publication, January, 1947) 


HE original impetus for writing this article might be traced to the 

reading of a book, a book that formed just another link in that 

seemingly endless chain of books one should read to “keep up” 
with the best educational thought of the day, a book that came into 
being as the result of the work of a committee appointed by the Amer- 
ican Council of Learned Societies. 1* 

This committee sounded a call to “specialists,” working in their 
own restricted fields, to attempt, from the point of view of their own 
interest, to achieve an “historico-philosophical” orientation; to awaken 
to the importance of setting their specific problem in a larger context, 
and thus to avoid meaningless particularity. While admitting that it 
is natural for man to orient himself first to his own culture, and his 
immediate geographic environment, they maintained that a liberal edu- 
cation should lead him to discover that his own culture has been power- 
fully influenced by other cultures in other portions of the globe. It 
should bring to him an awareness of the existence of other cultures 
and a desire to explore their distinguishing characteristics and their 
historical development. Finally such education should bring him also 
an awareness of the many factors, economic, social, political, geographic, 
and religious, which have been involved in the differentiation of one 
culture from another. The authors deplored the almost exclusive pre- 
occupation, in formal education, with our own culture to the neglect 
of the rich and ancient cultures of the Near East and Far East. They 
pointed to the global thinking generated by the recent war and wel- 
comed such thinking as a powerful corrective to this limitation. 


This set us to thinking, most seriously, of our own “specialty” 
and led us to attempt an objective examination of the literature in our 
field, to see how it would stand up against the challenge of provincial- 
ism, in view of this concept of a world-wide orientation. 


The results, in terms of information available on what has hap- 
pened in relation to the use of our particular educational tool outside 





This manuscript is intended as a preface to a series of articles illustrating the pos- 
sibilities as outlined in the paper. Though it is not research in the strict sense, it is pub- 
lished as an introduction to research. The first article will deal with the role of games, 
sports, and recreational activities in the cultural pattern of a primive people. 


*Superior figures refer to numbered bibliography at end of article. 
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that part of the globe which might be called the domain of Western 
civilization, were disappointing indeed. Scattered bits of information 
on the work of the Y.M.C.A.’s and Y.W.C.A.’s, fragmentary com- 
ments of occasional visitors, a few fine pieces of work done by citizens 
of other parts of the globe, studying in the schools of our country, 
make up the sum total of the results. 


And yet, stepping out of the confines of our own literature, our 
search has led us to an untold wealth of material collected by earnest 
scholars and investigators in other fields, fascinating, challenging, pro- 
vocative material, awaiting the attention of students with initiative, 
imagination, and tenacity. The course is not charted, true, but enough 
guides may be found to keep the ambitious adventurer from losing 
his way. 

The archives of research material in ethnology and anthropology 
alone contain enough to engage the energies of interested students for 
years to come. The most cursory examination of the number of fine 
studies done by ethnologists and anthropologists on the play life, games, 
amusements, and ceremonial dances of primitive peoples, will convince 
the most skeptical, unless he be utterly lacking in perception, of the 
possibilities inherent in such studies with relation to our own field. 

A study of the literature dealing with the recreational life and 
physical activities of the peoples of the Pacific will provide for the 
physical educator and the recreation worker unexcelled laboratory ma- 
terial in social phenomena. 


Is it not possible in our search for the answers to the “worthy 
use of leisure time” that we can learn something from the Polynesians? 
Keesing refers to them as having “the highest standard of leisure the 
world has ever known.”? The recreational activities with which these 
people occupied their leisure, and the contributions these activities 
made to their total living, conform to many standards which we set 
up for our own programs. It was recreation for all, not for a few. 
Activities were closely integrated with actual living.. Participation 
was the role of each individual in his own small way, and the effect 
was a powerful socializing influence tending to promote group soli- 
darity and establish a definite feeling of “belonging” in all. 

The importance of the social value of inter-village and inter-com- 
munity competitive games was very great. Young and old were brought 
together as a group and all efforts were made to make each competition 
a truly festive occasion. It was an obligation on the part of the local 
people to provide food for the visitors, spectators as well as competitors, 
and when the games were played at another village this hospitality was 
repaid and amplified if possible. Polynesians attending the sports events 
of people of a so-called higher culture are struck by the lack of 
organized hospitality in providing free food and drink, a custom so 
characteristic of their own culture.3 














$ Wun 5 
FOSSA, fear 


Ss * “- oes agin elcome 
degen me Sil a i RR eee He es, rt a AA 














me 


Sng IPSEC! 











106 RESEARCH QUARTERLY 


Surely it is no less the business of the teacher of physical edu- 
cation and the recreation worker than it is that of the anthropologist 
to study human society and culture, to learn the way in which human 
beings interact in society, to learn the nature and function of social 
organizations, and means and effects of culture contact and change. 
He will learn that specific physical activities and recreational pur- 
suits vary from society to society, but that such activities are common 
to all societies and serve certain constant functions. Recreational pur- 
suits held in common by a social group serve to enhance the social 
consciousness of that group and also to give psychological security to 
the individual. Malinowski* has said that such forms of relaxation 
and artistic stimulation of the nerves and muscular system are a con- 
dition of healthy communal life and also fertilizing factors in cultural 
development and progress. 


Before proceeding we feel it is only fair to post certain storm 
warnings. In our judgment certain prerequisite qualities should be 
present in the hopeful research worker in this field. First he must 
have sufficient sociological understanding to realize that “man does 
not live by bread alone”; and that after the basic needs of food, cloth- 
ing, and shelter, and that other need of humans to explain the im- 
ponderable mysteries of life and death have been met, there still re- 
mains the surplus of time and energy to be spent on the things which 
give life joy and color and meaning. 


He should possess the fundamental conviction that play is as uni- 
versal a part of the cultural patterns of the world, primitive or mod- 
ern, as the economic, familial, religious, or political life. He should 
realize further, that to minimize, ignore, or pervert this fundamental 
human need is dangerous. We need not search far in our own culture 
to find evidence that when children are allowed to proceed along the 
road to maturity without proper training and guidance for the pur- 
suit of their own recreational life, both while they are children and 
after they become. adults, we have a multitude of social ills that are 
much more difficult to correct than they might have been to prevent. 
Among adolescents we have increasing juvenile delinquency and an 
unhealthy preoccupation with commercialized forms of recreation which 
do not provide that outlet for creative energy that comes from active, 
interested participation. Among adults we have lethargy, boredom, 
neuroses, and over-indulgence in many undesirable forms. 

The student who would attempt this sort of research must be 
free from what a well-known anthropologist ® has termed “ethno- 
centrism,” a form of cultural myopia that limits the mental vision and 
cultural horizon to the confines of one’s own society. He must have a 
sufficiently developed understanding to comprehend and appreciate 
that man in his varied career in this world has worked out an endless 
variety of adaptations to life’s problems, and that through a sympathetic 
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comprehension of the answers found to these problems in other so- 
cieties we may be in a better position to understand our own society, 
its contradictions and failures, its achievements and its values. 

Here is a challenge to students and teachers who are vitally 
interested in games, sports, recreational activities, as an integral part 
of the culture of all peoples; who are eager to seek out and under- 
stand the basic sociological significance of the aspect of culture:® 
students who are convinced that all human activities are interrelated, 
and that none is so trival that it may not influence or illustrate others 
which may seem more important; that while the food they eat and 
the tools they use may determine the chances for survival of a people, 
the games they play or the stories they read may establish the character 
of a nation.? 

Here is research to be done by individuals who realize that under- 
standing between peoples forms the basis of our hope for the future of 
civilization,® and who have the firm conviction that the tools of 
our profession in the hands of skilled and sincere teachers may con- 
tribute in full measure to the building of such an understanding. 

Such research needs students who are curious, who want to seek 
out the significance of such a fact as is represented in the knowledge 
that the National Council of Physical Education in Chile is placed un- 
der the immediate control of the Ministry of War, while in Bolivia the 
General Bureau of Physical Education is under the control of the 
Minister of Public Instruction. Is the difference in control one of 
administrative expediency or is it evidence of a different philosophical 
approach? 

What part has instruction in physical education activities played 
in the “cultural mission” experiment in education in Mexico? Do we 
find in this experiment any implications for the solution of our prob- 
lems in rural education, particularly in economically depressed areas? 
Could this be the answer to instruction in physical education in our 
many thousands of one-room, one-teacher schools?1° 


What part is our profession and its thousands of members to play 
in the future international role of the United States Office of Educa- 
tion?!1 How are the tools of our profession viewed by the United 
Nations Organization?! How will physical education activities 
and preparation for “worthy use of leisure” enter into the plans of 
UNESCO? Because we believe that physical education should form 
an integral part of any scheme of education, and, in its broader aspects 
including recreation and rehabilitation, an important instrument in the 
total picture of civilization, the members of our profession will fol- 
low with great interest the deliberations of the representatives at 
UNESCO. 

The possibilities of certain types of historical studies in our field 
have been briefly suggested. If a complete history of physical educa- 
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tion is ever to be written, we must take cognizance of the vast geo- 
graphical areas of the world which as yet are relatively untouched in 
our literature. The historical record of the recreational life and physical 
activities of the many millions of human beings outside of our western 
civilization cannot be ignored if we hope to be able to trace the 
intricate interrelationships of our field with other social phenomena. 
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A Factor Analysis of Some Cardiovascu- 
lar-Respiratory Variables and Tests 


By LEONARD A. LARSON 
New York University 
New York City 


(Submitted for publication, January, 1947) 


HREE studies have been reported on the analysis of the interrela- 

tionships of cardiovascular-respiratory variables. Two of these 

studies were on college men*: 8* and one on college women.* The 
purpose was to analyze the table of intercorrelations using the Thurstone 
Factor Analysis Method® 1° 1! to determine the principal components 
constituting the table of intercorrelations or the components describing 
the cardiovascular-respiratory variables. When dealing with a number of 
variables one can assume that they are all unrelated and therefore there 
are as many abilities as tests; or one can assume that they are all per- 
fectly related and therefore there is only one ability or characteristic. 
Neither of these hypotheses is correct as indicated by the reported 
studies and some number between these extremes describes cardiovas- 
cular-respiratory function. 

McCloy® found in analyzing 14 cardiovascular variables that they 
were described by at least eight principal components. The variables 
included horizontal and standing systolic, diastolic, and pulse pressures; 
a comparison of pulse rate and pressures between the horizontal and 
standing positions; and pulse rate after exercise. Three of the isolated 
factors were identified as factors of pulse rate (heart rate, standing 
heart rate, and heart rate return to normal), and five as factors of blood 
pressure (general blood pressure, pulse pressure, reclining diastolic pres- 
sure, effect of position on systolic, and reclining systolic pressure). 

Utilizing 32 cardiovascular variables in examining a group of col- 
lege women, Murphy? found nine primary components. Three of the 
isolated factors were described as factors of pulse rate and six as blood 
pressure factors. 

In a study conducted by Quamme,® 20 cardiovascular-respiratory 
variables were analyzed by the factor method. Eight factors were isolated. 
Four were described as factors of pulse rate (circulatory recovery to 
exercise, effect of pulse rate to exercise, pulse rate return after exercise, 
and pulse rate response to postural changes), and four as factors of 
blood pressure (head or maximum pressure, diastolic changes with body 

The data for this study were gathered in the course aie research conducted under the 
auspices of the National Research Council Committee on Selection and Training of 
Aircraft Pilots with funds made available by the Civil Aeronautics Administration. The 


author wishes to acknowledge help given on statistical computation by Mr. H. J. Quamme. 
*Superior figures refer to numbered bibliography at end of ‘article. 
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position, diastolic pressure, and maximum pressure changes with body 
position). 


STATEMENT OF PROBLEM 


The purposes of this research are two-fold (1) to analyze some 
cardiovascular-respiratory variables for their principal components, and 
(2) to determine the physiological characteristics of ten selected cardio- 
vascular test index scores on the basis of the isolated factors. 


PROCEDURE 


Experimental_—Nineteen cardiovascular-respiratory variables were 
administered to 145 Springfield College men, the majority of whom 
were major students in health and physical education. Some of the 
variables were used to score ten cardiovascular tests. The total of 29 
variables and tests was used in the statistical analysis. The nineteen 
variables served as material for the isolation and identification of the 
factors. The ten test batteries were included to determine their physi- 
oiogic characteristics on the basis of the isolated factors. The following 
precautions were used in obtaining data for the statistical analysis: (1) 
exercise was not allowed before the examination (during that day); 
(2) subjects with emotional disturbances were not used; (3) subjects 
not receiving a normal amount of sleep the night previous to the exam- 
ination were eliminated; (4) subjects with heart defects (medical exam- 
ination) were not used; (5) subjects having a heavy meal just before the 
examination were asked to return in one hour; (6) subjects with colds 
were not used; and (7) the subjects selected represented a physically fit 
group of young men (18-24 years of age), all of whom participated in 
physical education activities as part of their education. The measurements 
were all made in one week in September. All questionable results were 
rechecked and were dropped or retained on the basis of the above criteria. 


Statistical—_The Thurstone Method of Factor Analysis!%11 was 
used in the isolation of the factors. This method begins with a table 
of intercorrelations (406 rectilinear correlations in this problem) and 
results in the isolation of a number of factors or components which con- 
stitute the major part of the tests analyzed and gives a correlation 
between the factor and the tests used in the analysis. The method assumes 
that the tests and variables are related and that they can be described 
by a relatively small number of factors. The name given to the factor, 
or identification, is determined by how much one knows about the tests 
and the field within which he works. The factor method involves no 
assumption as to the nature of the factors. In this research two physiolo- 
gists have reviewed the statistical results and have given suggestions as to 
the nature of the factors isolated. 
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Limitations.—In the interpretation and application of the results 
of this research, the following should be considered: 


1. The variables and tests selected cannot be assumed to indicate 
a complete measure of cardiovascular-respiratory functions. 

2. The isolated factors and their identification are. based on the 
variables (not test batteries) used in the study and on the judgment of 
physiologists. 

3. The sample consists of Springfield College men from 18 to 24 
years of age, all of whom participated in physical education activities. 

4. The factor analysis studies reviewed cannot be directly compared 
because the variables are not identical, the system of rotation and the 
identification of the factors involve judgment skill, that is, the best rota- 
tion may not be made and the factors may be identical but given differ- 
ent names, and the data are on different institutional populations and 
on men and women. 


5. The variables and tests are not all completely described by the 
isolated factors. Factors will appear only if they are present in at least 
two of the tests analyzed. Because some of the tests are not completely 
described by the isolated factors, the implication is that other factors 
are possible. 

6. The ten test batteries are described by the isolated factors in 
terms of their “index scores.” 


7. The reliability range of the cardiovascular-respiratory variables 
and tests is .70 to .95 which means that a high degree of confidence as 
to the stability of the factor cannot be made in some instances. 4 


RESULTS 


The results of the factor analysis of the table of intercorrelations 
(Table I) show that there is no common factor, that is, a factor common 
to all variables and tests used in this analysis (Table II). The eight 
isolated factors are described by independent clusters of variables, which 
give evidence that cardiovascular-respiratory variables are sufficiently 
independent or unrelated so that at least eight factors are necessary to 
describe cardiovascular-respiratory function. The eight factors cannot be 
considered entirely independent or entirely unrelated, but they are not 
sufficiently related for cross prediction. 


With the aid of two physiologists* the eight isolated factors were 
analyzed and physiologically described. The single variables were statis- 
tically analyzed to determine their “net” contribution to each factor. 
Those yielding significant “net” contributions were selected as predictive 
of the factor. The multiple correlations (Table III) indicate the signi- 
ficance of the prediction and were used as an aid in the identification of 


*The writer wishes to express appreciation to Dr. P. V. Karpovich and Dr. Percy 
Dawson for their description of the isolated factors. 
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29° —.1392 —.1416 —.0150 


0974 —.1419 .2983 — 





6899 6424 —.3963 


-1050 .2552 


8 9 10 i 12 
4443 iB: 
-.2715 “4445 


0894 1095 100! —.0010 
0447 .0298 .0952 —.0692 .4084 
0202 .1456 .1022 0293 —.0545 


.0533 


1415 —.2355 —.0641 —.1744 .1168 


0924 .4765  .4077 


.2137 —.0239 





0006 .2077 =.4382 —.1757 —. 1406 














0551 1949 —.0618 3133 —.1131 
1768 .2353 5974 —.2679 0623 
0536 .2671 4173-1336 0317 
8069 0940 —.4258 4825 0377 
TTR 4347 —.2560  .7007 —.0126 
6122 3782 —.5920 .8949 —.0280 
6097 596 —.3304 8829 —.0076 
5917 3957 —.5638 .8594 —.0299 
0418 5619 3357 .2430 —.0640 
Oll4 4760 2706 0654 0567 


1696 0719 


2109 —.1584 4596 


— 





—.1% 


—,14) 


—.110 

139 
—.116 
—.010 
—.175 


—.168 


—. 167 
—051 
082 


.600 
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TABLE I (cont.) 
ZERO-ORDER INTERCORRELATIONS OF CARDIOVASCULAR-RESPIRATORY 
VARIABLES AND TESTS 

7 2B 14 5 é 17 18 19 2 2342 22 23 24 25 a. 2 2 

*Hor., Horizontal er 
— —— P.R., Pulse Rate 

P.P., Pulse Pressure an 
—— per Sit., Sitting 

Std., Standing wil 
—— D.P., Diastolic Pressure 

S.P., Systolic Pressure 
“—- a9 e B.H., Breath-holding - 

V.C., Vital Capacity Seas: 
—— Min., Minutes 

Sec. Seconds Bete es <ctihlidnsitaipstitacenile 
can | Ex., Exercise 
: -_ — — —E 
45 —.1902 Z tas 
bb —.1477 6437 2 
68 —.0452 .2214 —.2325 arti 
39 —0835 .1236 1831 
06 0608 —.0574 0431 .022 3391 . 
31 —1103 1242 .0341 %5  .4641 —.5613 
523 1397 —.5048 —.0232 —.4328 .2657 3601 —.1279 
317 —1165 .6423 .1999 .2828 .3703 .1523 .1912 —.2860 
377 —0103 0814 .0740 —.0773 —.1760 —.0195 —.1057 —.2958 —.0816 é 
126 —1752 6031 .1950 .2204 1860 —.1654 .3074 —.6709 6155  .4067 
280 —.1045 0804 .0907 —.1005 .0764 —.2709 .2914 —.3862 —.0552 5737 .6775 
076 ~1688 3810 .1766 043 2234 —.1696 3322 —.4911 .3952 5034 9166 -~—.8718 : 
299 —1677 29631144 0674 ~—-.0529 —.2919 3030 —.5903 1835 ©5676 «=.8499- 9336 = .9485 
“0 —0516 .0527 .2750-—.4641 8657 2749 4105 3605 1962 —.1083 .0829 1084 1743 0276 
se] 0828 —.5231 —.1359 —.3426 .4333 2771 0838 6579 —.3135 —.0711 —.3691 —.0039 —.1698 —.2116 .4940 
15% 6000 —.1434 —.0793 —.011! .0773 .1309 .0034 —.2545 —.0687 —.2483 —.2106 —.1548 —.1921 —.2428 .0509 .1619 
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ROTATED 
I II Ill IV V 

1. Hor. P.R.* 0820 .1272 7227. .2259 —.2236 
2. Sed. P.R. .0021 —.1955 .8287 —.1510 1311 
3. Hor. S.P. 3870 .1405 2173 —.2154 —.0440 
4. Hor. D.P. —.5050  .1763  .3468 —.4439 —.0182 
5. Hor. P.P. 6873 .0270 —.1109 —.5734 —.1006 
6. Sit. S.P. 3270 0704 .2963 —.0777 - .05 46 
7. Sit. D.P. —.5689 0393 3083 09 .2867 
8. Sit. P.P. 7601 —.0670  .0522 —.1277 —.2780 
9. Std. S.P. 2208 —.0381 .3735 —.0368 —.0997 - 
10. Std. D.P. —.6766 —.1094  .2792 —.0857 —.3573 
11. Std. P.P. 8584 0847 2023. 05951358 
12. V.C. —.1098 .0459 —.0671 —.2287  .2128 
13. B.H. aft. Ex. near oor ae 1346 —.2453 —.0765 .1587 
14. PLR. ERLE eng Sire 

2 min. aft. Ex. 0263 3569 =.8508 —.2126 —.1475 
15. Diff: Std. rg 

P.R. & P.R. 

2 min. aft. Ex. 0836 5975 3834 1455 .2697 
16. Diff: Hor. Sed. P.R. —.0875 —.3781 1732 —.3767  .1591 
17. Diff: Hor. Std. $.P. —.1125 —.2484 .3630 2351 —.1758 








. Diff: Hor. Std. D.P. —.1803 —.2912 —.0584 —.5589 —.4315 








. Diff: Hor. Std. P.P. 


-0681 —.0188 3385 7491 —.2961 









TEST BATTERIES 












































VI 
.1689 
1292 
2701 
2597 
0529 
2087 
3096 
0292 


0733 


—, (917 


.0131 


3580 


§§95 


.1002 


O15 
.0457 
.2718 
1703 


.0406 


1234 


.0903 - 


.1180 
.0660 


0604 


0501 


20. McCloy Test —.4419 0274 —.4439 1043 —.3641 — 
21. Barach Test —1159 —.2267 .8486 —.1207 —.0296 
22. Stone Test Kye 7490 ~,0678 —.0412 —.0677 iol 3174 - 
23. Basal Metab. Test | .6073 —.1021 .7572.—.0541 .1890_ 
24. Tigerstedt Test .9091 0702 .1060 =.1086 ; 2338 
25. P.P.xP.R. (Std) | .7877 —.0414 5283  .0050 1678 
26. P.P. x P.R. St eee 

+ Dias. (Std.) 8875 0094 3557 0313. .2471 
27. Crampton Test —.0420 0183 .2304 § .3653 —3733 


.2558 








. Schneider Test 


—.0852 —.1408 —.3863 .1733 —.2261 








\ pute 
est 


Larson —.3429 .1684 —.1330 —.0804 








-3141 -4856 —.1899 


.1808 
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Factor LOADINGS OF CARDIOVASCULAR-RESPIRATORY VARIABLES AND TESTS 


Rotated Unrotated 
VE... Vi Ee he 
0403 —.2557 .7417 .7417 
1126 —.4465 .9937 .9932 
5237 .5028 8651 .8650 























3187 .2136 .8183 8181 
0134 “2560 8928 .8923 
4610 6490 .8873 8873 
0407 "5176 8748 8751 
.4620 2031 9341 9353 
0639 8471 9282 - 9288 
1502 4539 9198 9197 
0054 4498 1.0094 1.0092 
2752 "2287.—=.3725 3724 
3168 . 1734 ; $799 .5801 
0461 —.2464 9918 9914 
1968 .0732 6491 6488 
~.3773 —.3089 5885 5882 
4803 5433. 8921  .8920 
2675 2370 -.7761_—«.7757 
0640 ' 1489 i 7963 ~ 7961 
—.0019 .6098 .9237 9235 
-.1264 .0744 .8301 8298 
2946 .1332  .8099 .8094 
~.0391 0191 1.0070 1.0066 
.0888 .1999 9606 9611 





.0013 .2427 -9920 .9917 














.0375 .0550 9833 -9827 
-4519 .5613 -9126 -9126 
.2012 -6145 .6756 .6758 

-3181 -6418 


-6420 
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the factors. It will be noted (Table II) that the variables are not all 
completely described by the eight factors. This means that the tests 
which are not described are specific with respect to the variables in this 
experiment. Unless a factor is found in at least two tests, it cannot be 
isolated. 

The process of factor rotation involved only the single variables 
and did not include the test batteries. The test batteries were included 
only for the purpose of determining their physiologic characteristics on 
the basis of the isolated factors. 


Factor One.—Because of the high loadings (correlations) of sitting 
and standing pulse pressures (.76, .86) this factor is identified as pulse 
pressure. The multiple correlation of these two variables with the factor 
is .89. The horizontal pulse pressure correlates .69 and diastolic pressure 
(horizontal, sitting, and standing) also correlates significantly. The inter- 
correlations of diastolic pressure and pulse pressure, however, are not 
sufficiently high to predict one, given the other. Pulse pressure must 
therefore be considered as a single component when compared with the 
other cardiovascular-respiratory measures. 


Factor Two.—The physiologic variables correlating significantly with 
this factor are: pulse rate two minutes after exercise (.36), the differ- 
ence between standing pulse rate and pulse rate two minutes after 
exercise (.60), and the difference between horizontal and standing pulse 
rates (—.38). The multiple correlation of these variables is .65. This 
factor is therefore described as pulse rate response to exercise in relation 
to normal. 


Factor Three.—The variables correlating significantly with factor 
three are horizontal pulse rate (.72), standing pulse rate (.83), and 
pulse rate two minutes after exercise (.85). The “net” contribution of 
the horizontal pulse rate in the presence of standing pulse rate and pulse 
rate two minutes after exercise is insignificant. The multiple correlation 
of the remaining two tests is .89. The factor is described as pulse rate 
response to exercise. 


Factor Four.—The variables significantly related to the factor are 
horizontal diastolic pressure (.44), horizontal pulse pressure (—.57), 
difference between horizontal and standing diastolic pressures (—.56), 
and the difference between horizontal and standing pulse pressures (.75). 
Cn the basis of this evidence it is described as pulse pressure in response 
to postural change. The multiple correlation of these variables (elimi- 
nating horizontal pulse pressure) is .'78. 


Factor Five.—The physiologic variables correlating significantly are: 
sitting diastolic pressure (—.29), sitting pulse pressure (—.28), stand- 
ing diastolic pressure (—.36), difference betwen horizontal and standing 
diastolic pressure (—.43), and the difference between horizontal and 
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standing pulse pressures (—.30). Sitting and standing diastolic pressures 
have the largest “net” significance in predicting the factor (.74). It is de- 
scribed as diastolic pressure and changes with respect to body position. 


Factor Six.— Vital capacity and breath-holding 20 seconds after 
exercise correlate significantly (.36 and .56). These items yield a mul- 
tiple correlation of .58 and are described as the respiratory factor. The 
multiple correlation is not significantly high to effectively predict the 
factor; however, the items are not significantly related to the other 
components; therefore, it can be considered another aspect of cardio 





vascular-respiratory function. 


Factor Seven.—The items correlating significantly are horizontal 
systolic pressure (—.52), horizontal diastolic pressure (—.32), sitting 
systolic pressure (—:46), sitting pulse pressure ( 


TABLE III 


.46), and difference 


MULTIPLE CORRELATION OF VARIABLES WITH THE FACTORS 





Horizontal Pulse Pressure 
Sitting Pulse Pressure 


_Standing Pulse Pressure 


Pulse Rate 2 Min. after Exer. 
Diff. in Std. Pulse Rate and 
Pulse Rate 2 Min. after Exerc. 


. Diff. in Hor. and Std. Pulse Rate _ 








. Pulse Rate 2 Min. after Exerc. 


Hor. Diastolic Pressure 
Diff. in Hor. and Std. Diastolic 


Factor Variable in the Factor* 
I 5. 
Pulse Pressure 8. 
af. 
II AA aii 
Pulse Rate Response 14. 
to Exercise in Rela- 14; 
tion to Normal 
16. 
III ° 
Pulse Rate Response 1. Hor. Pulse Rate 
to Exercise 2. Std. Pulse Rate 
14 
IV Tie 
Pulse Pressure in 4. 
Response to Postural 18. 
Change Pressure 


Diff. in Hor. and Std. Pulse Pres. _ 





Vv 
Diastolic Pressure and 
Changes with Respect 
to Body Position 


Sitting Diastolic Pressure 
Standing Diastolic Pressure 


. Diff. in Hor. and Std. Diast. Pres. 
Diff. in Hor. and Std. Pulse Pres. _ 





VI 
Respiratory Factor 


VII 





to Changes in Posture 


. Vital Capacity 


Breath-Holding 20 Sec. after Exer. 





. Hor. Systolic Pressure 
Blood Pressure Response 4. 
. Sitting Systolic Pressure 


Hor. Diastolic Pressure 


Sitting Pulse Pressure 


Diff. in Hor. and Std. Systolic Pres. 





Vill 


Systolic Pressure 


6 
8 
17 
FE 
6 
9. 
17 


Hor. Systolic Pressure 


. Sit. Systolic Pressure 


Std. Systolic Pressure 
Diff. in Hor. and Std. Systolic 


Pressure 





__RV-7,10 = .744 | 


Multiple Correlations 
RI.5,8,11 = .894 
_RL8,11 = 891 


RIT.14,15,16 = .654 





RIII.1,2,14 = .893 
__RIT.2,14 = 892 


RIV.4,18,19 = .775 


RV.7,10,18,19 = .740 





/RVI.12,13 = .577 





RVII.3,4,6,8,17 = .855 
RVII.4,6,8,17 = .854 
RVII.4,8,17 = .854 
RVIII.3,6,9,17 — .864 
RVIII.6,9,17 == .864 
RVIII.9,17 = .862 


— 


*The Arabic number preceding the name of the variable corresponds to the numbers of the 
variable in Tables I and II. 
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between horizontal and standing systolic pressure (.48). The multiple 
ccrrelation between horizontal diastolic pressure, sitting pulse pressure, 
difference between horizontal and standing systolic pressure, and the 
factor is .85. It is described as blood pressure response to changes in 
pesture. The identification, however, is with a low degree of confidence 
because of the lack of sufficient degree of differentiation of the various 
pressures. 


Factor Eight—The variables correlating are all systolic pressures in 
the various positions (horizontal, sitting, and standing). Standing systolic 
pressure and the difference between the horizontal and standing systolic 
pressure correlate (multiple) .86. It is identified as systolic pressure. 


INTERPRETATION OF CARDIOVASCULAR TEST “INDEX SCORES” IN 
TERMS OF THE IDENTIFIED FACTORS 

One of the major purposes of this analysis is to determine the 
physiologic characteristics of the ten selected cardiovascular test “index 
scores.” The “index scores” were therefore included in the factor 
analysis. This procedure does not establish the validity of these batteries 
as validity can be established only by its evaluation with respect to an 
acceptable criterion, but it simply yields a physiologic description on 
the basis of the identified factors. The procedure also assumes that the 
isolated factors resulting from the analysis of the single cardiovascular- 
respiratory variables are correctly identified. An example illustrates the 
use of the factor method in the analysis of the test batteries. A cardio- 
vascular test battery is a complex phenomenon. We are interested in 
two characteristics of this battery, its validity and its physiologic charac- 
teristics. It is in the latter that the factor analysis procedure makes a con- 
tribution. If, for example, it is found that three factors correlate per- 
fectly with the test battery, it can be assumed that the test battery is 
fully described by the three factors, and if the factors are correctly 
identified, the description of the test battery is established. So it is in 
this analysis. If diastolic pressure correlates perfectly with a test battery, 
the test battery is physiologically the same as diastolic pressure, assuming 
that no other variables are reacting on both the test battery and diastolic 
pressure to produce the perfect correlation. If a large portion of the 
relationship is due to age, it would be hardly correct to say that the 
correlation is perfect between the diastolic pressure and the test battery. 
For a complete physiologic description it is necessary to know what con- 
stitutes the relationship. If the variables could all be assumed to be pure, 
this problem would be eliminated, but this cannot be assumed to be true 
in circulatory-respiratory variables. 

In Table IV the relationships between the eight isolated cardio 
vascular factors and the various test batteries have been listed in terms 
of significant correlations. It is noted that Factor One (Pulse Pressure) 
correlates significantly with five test batteries (Stone .75, Basal Metabolic 
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.61, Tigerstedt .91, P.P. x P.R. .79, and P.P. x P.R. - Dias. .89). The 
reason for such correlations is apparent as the tests have for their major 
constituent pulse pressure. The descriptions by the other seven isolated 
factors are hardly significant, with the exception of the Basal Metabolic 
Test and the P.P. x P.R. which correlate .76 and .53, respectively, with 
Factor Three (ptilse rate response to exercise). 


TABLE IV 


PHYSIOLOGIC DESCRIPTION OF CARDIOVASCULAR TESTS BY IDENTIFIED FACTORS 





Identified Factors Significant Correlations with Tests 


and Identified Factors 


























: cae __ Correlation 
1. Pulse Pressure 1. Stone 749 
2. Basal Metabolic .607 
3. Tigerstedt .909 
‘BP. s PR. .788 
ee eee! Se, ee ee “Te 888 
2. Pulse Rate Response to No significant 
Exercise in Relation to Normal 5 correlation 
3. Pulse Rate Response to 1. Barach 849 
Exercise 2. Basal Metabolic £757 
RAgk ie IS ee a 528 
4. Pulse Pressure in Response to 
Postural Change _—__1. Crampton 365_ 
5. Diastolic Pressure and Changes 1. McCloy —.364 
with Respect to Body Position 2. Stone —,.317 
3. Crampton —.373 
ee _4. McCurdy-Larson ‘ +358 
6. Respiratory Function 1. Crampton 256 
__——Sj.,~“MccCurdy-Larson pas 486 | 
7. Blood Pressure Response to 1. Stone -.295 
Changes in Position _ 2. Crampton ERar ‘oe 
8. Systolic Pressure 1. McCloy .610 
2. Crampton 561 
3. Schneider 615 
_ 4. McCurdy-Larson 318 


Factor Two (pulse rate response to exercise in relation to normal) 
does not correlate significantly with any of the ten cardiovascular test 
batteries. The factor is included in some of the test batteries but the 
influence of this variable on the index score appears to be lost in the 
summing process. This is an important and comparatively unrelated 
factor. 


The Barach Test (.85), the Basal Metabolic Test (.76) and the 
P.P. x P.R. Test (.53) all correlate significantly with factor three (pulse 
rate response to exercise). These tests all include this component as a 
major constituent. The factor is included in some of the other batteries 
(McCloy and Schneider), but the index scores of these tests do not 
reflect it. 
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The Crampton Test is the only test correlating with factor four 
(pulse pressure in response to postural change, .37). This test does not 
include the factor, but the slightly significant correlation is probably due 
to the relationship of the factor to systolic pressure postural change, 
which is a major element of the test. These two factors are related to 
approximately this degree. 


Factor five (diastolic pressure and changes with respect to body 
position) correlates slightly with four tests (McCloy, Stone, Crampton, 
and the McCurdy-Larson Tests). The signs are not consistent in the 
McCurdy-Larson Test, but this may be due to the system used in scaling 
the diastolic pressure. The high and low diastolic values receive a low 
index score in the McCurdy-Larson Test. The factor is rotated as a 
negative factor, and hence the positive relationship with the McCurdy- 
Larson Test. 


The respiratory factor (six) correlates significantly (.49) with the 
McCurdy-Larson Test and slightly with the: Crampton Test. The 
McCurdy-Larson Test correlates because of vital capacity and breath- 
holding ability 20 seconds after exercise; the Crampton Test, because of 
the relationship of pulse rate to the factor. The correlations with the 
other tests are zero. 


Factor seven (blood pressure response to changes in position) cor- 
relates significantly with the Crampton Test and slightly and negatively 
with the Stone Test. This is in accord with the nature of the factor and 
tests. The difference in the horizontal and the standing systolic pressures 
is a positive element of the factor and one of its significant elements. 
The Crampton Test includes this element, and also correlates positively 
with the factor. The horizontal systolic pressure has a negative correla- 
tion with the factor which indicates that, as the factor increases, the 
horizontal systolic blood pressure decreases. The sitting pulse pressure 
correlates with the factor, and this item is the major constituent of the 
Stone Test, and hence the negative correlation. 


Factor eight (systolic pressure) describes partially the McCloy Test 
(.61), the Crampton Test (.56), the Schneider Test (.61), and the 
McCurdy-Larson Test (.32). The correlations are significantly higher 
in the first three tests. The Crampton and the Schneider Tests include 
this factor as a major component. In the McCloy Test, however, the 
only account which can be made is the relationship of systolic pressure 
to diastolic pressure in the standing position. 


It will be noted in Table II that the eight isolated factors describe 
six of the ten test batteries from 90 percent to 100 percent,* two from 
80 to 90 percent, and two from 60 percent to 70 percent. It is also inter- 





*The factor loadings squared are interpreted as percentages. 
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esting to note that not one of the test batteries reflects all eight isolated 
factors. The tests correlating significantly with each of the eight isolated 
factors vary from one to five. The correlations, however, are not suffi- 
ciently high to predict the factor. It should be pointed out that the reason 
the tests may not reflect the various components may be because of the 
system of scaling. For example, diastolic pressure in the McCurdy-Larson 
Test is scaled according to the logical physiological interpretation; that 
is, the mean is used as the desirable value (best score) with the extremes 
as the undesirable values (low scores). The unscaled diastolic pressure 
will therefore not correlate with the scaled value, at least using the linear 
method. The index score, therefore, may have a higher degree of validity 
as an estimate of circulatory-respiratory function than is reflected by its 
correlation with the factor. 


IMPLICATIONS OF THE FACTOR ANALYSIS FOR TEST 
CONSTRUCTION 


The results of the factor analysis of the 29 variables and tests indi- 
cate two conclusions: (1) the selection of variables for the construction 
of the ten test batteries does not include all the factors describing cir- 
culatory-respiratory function, and (2) the “index scores” of the ten 
selected tests do not reflect the isolated factors of circulatory-respiratory 
function. It was also found that in some test batteries, where the factor 
represents a major part of the battery, the “index score” does not reflect 
it to a significantly high degree. This is because of the influence of the 
other items in the battery. It does not necessarily mean that these tests 
are invalid with respect to a specific objective, but it does mean that 
they are invalid as a complete index (in terms of the selected variables) 
of circulatory-respiratory status. 


The implications of these findings for test construction are three: 
(1) the selection of the items for the complete estimate of cardiovascular- 
respiratory function should include all factors of those systems; (2) if 
some of the factors are to be eliminated, this must be done in terms of 
a specific criterion, which represents the use to be made of the selected 
factors; and (3) the sum of a number of cardiovascular-respiratory 
measures may be a better index of a specific ability, but it does not reflect 
the factors which are constituent parts. With respect to the individual 
factors, the index score has no diagnostic value. In order to know the 
functional status of the individual each factor must be considered indi- 
vidually. If one is interested in knowing the circulatory respiratory 
status of the individual, all factors must be considered, with the sum of 
the scores, if desired, as yielding additional information. If one is inter- 
ested in describing a particular ability, the items should be selected 
which significantly describe it with a “weighted” sum prepared (regres- 
sion equation). 




















CARDIOVASCULAR-RESPIRATORY VARIABLES AND TESTS 121 


CONCLUSIONS 

1. A factor analysis of some selected cardiovascular-respiratory 
variables shows eight factors. These have been identified as pulse pres- 
sure, pulse-rate response to exercise in relation to normal, pulse-rate 
response to exercise, pulse pressure in response to postural change, 
diastolic pressure and changes with respect to body position, respiratory 
function, blood pressure response to changes in position, and systolic pres- 
sure. Other factor analysis studies (not comparable with respect to spe- 
cific factors) show a minimum of eight factors describing approximately 
the same selection of variables. i 


2. Utilizing the results of the factor analysis and in addition a 
logical analysis of the table of intercorrelations, the following variables 
were selected as being sufficiently unrelated as to be considered inde- 


pendent: 


a. Horizontal pulse rate. 

b. Difference between horizontal and standing pulse rates. 

c. Difference between normal standing pulse rate and pulse rate 
two minutes after exercise. 

d. Horizontal systolic pressure. 

e. Difference between horizontal and standing systolic pressures. 

f. Horizontal diastolic pressure. 

g. Standing diastolic pressure. 

h. Difference between horizontal and standing diastolic pressures. 

i. Horizontal pulse pressure. 

j. Difference between horizontal and standing pulse pressures. 

k. Breath-holding 20 seconds after exercise (as found in the 
McCurdy-Larson Test). 

l. Vital capacity. 


3. The implications of the factor analysis of the cardiovascular- 
respiratory variables and tests for test construction are: (a) the selection 
o; the items for a complete evaluation should include all factors; (b) if 
seme of the factors are.to be eliminated, this must be done in terms of a 
specific criterion, which represents the use to be made of the selected 
factors; and (c) the sum of a number of variables may be a better index 
of a specific ability, but it does not reflect the factors which are con- 
stituent parts 


4. The eight factors isolated in this research are not described by 
any one of the ten test “index scores.” The reasons for the lack of rela- 
tionship are: (1) the factor may not be included in the battery; (2) the 
number of items in the battery may be so numerous that the influence 
of the factor is not reflected; (3) the system of scaling the items in the 
test batteries makes the same items incomparable; and (4) the test battery 
may yield information not included in the eight factors. 
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The Relationships of External Fat To 
Physical Education Activities 
and Fitness Tests* 


By RAMON W. KIREILIs 
AND THOMAS K. CURETON 
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Urbana, Illinois 


(Submitted for publication, December, 1946) 


THE PROBLEM 


HIS study aims to show the relationship between fat as measured 
by calipers at six places on the exterior of the body and several 
aspects of physical education work: 


1. Relationships to certain structural physical tests, ie., ponderal 
index, calf girth, vital capacity residuals, biceps girth, and 
somatotype rating. 


to 


Relationships to certain functional organic efficiency tests, i.e., 
strength, strength per pound of body weight, expiratory force, 
flarimeter breath holding after chair stepping for 60”, and 
various pulse rate and blood pressure tests. 


3. Relationships to motor fitness performances on the Illinois 
14-Item Motor Fitness Test, the Cureton 18-Item Motor Fitness 
Test, and the Mile Run. 


I. SURVEY OF RELATED STUDIES 


A. Trends of Relationship Between Health and Fat on the Body.— 
The importance of obesity is amplified by the Metropolitan Life Insur- 
ance Co. upon the issuance of the following statement: 


Men who are 35 percent or more above average weight for their age and 
height have a mortality rate of 1V times that of the average man; also, when 
the abdominal girth is more than 2 inches greater than the chest girth at 
full expansion, the extra mortality rate is 50 percent above the excess 
mortality associated with the overweight itself. Diseases such as high 
blood pressure, albuminaria, glycocuria, cerebral hemorrhages, angina 


pectoris, and Bright's disease show unusually high indices with the obese 
individuals.22** 


*All of these tests are described and standard score rating tables are given in 


Cureton’s Physical Fitness Workbook, 1944 edition or 1947 edition published by C. V. 
Mosby Co., St. Louis. 


**All reference numbers hereafter used refer to the alphabetically arranged bibliog- 
raphy at the end of the paper. 
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Obese individuals usually have a high frequency of the following 
diseases: hypertension,? functional disease of the heart, kidneys and 
pancreas (diabetes) ,1® gallstones, heart lesions, coronary artery disease, 
lung disease, arthritis and exhausting bronchitis. !? 

Obese individuals not eating normal amounts of carbohydrates and 
eating abnormal amounts of fats and proteins are liable to a condition 
known as acidosis. These obese individuals cannot participate in long 
continuous physical activities because of the deficiency in carbohydrates 
and the excess of fat stored in the depots. This condition of acidosis 
causes headaches, nausea, and occasionally vomiting, weakness, sleepless- 
ness, muscular aches, acid mouth, sour stomach, acid urine, acid sweat, 
“sour disposition” and blood vessel disease which leads to other com- 
plications. ? 4 


B. Relationships Between Adipose Tissue and Motor Performances. 
The obese individual is inefficient in work production that is long 
and continuous. It is no wonder that an obese individual consuming a 
high fat diet is unable to participate in long continuous endurance 
activities.18 Excessive fat results in the loss of “tonus” of the abdominal 
wall which affects the blood supply, circulation, and the elimination of 
waste products!® and causes dilation of the heart and poor circulation 
of the blood.® The vital organs become surrounded with fat and are 
weakened.?1 The excess fat interferes with breathing, and often results 
in fallen arches, poor posture, a bone disease, and an increase in 
weight.22 Sweating is more profuse due to the insulating effects of 
adipose tissue. The obese individual becomes overheated.1® 

Every pound of fat requires 4,500 feet of blood vessels.1° The 
obese person is like an overloaded truck, usually in danger.2® A great 
deal of the blood goes to fat instead of the nerves, muscles, and vital 
organs.? The added weight in muscular exertion places a greater load 
upon the heart and circulatory system. 

The obese individual is usually awkward,?8 and the muscles become 
flabby from the lack of exercise.2® He is usually not agile and loses free 
movement in limbs and joints.18 He is seldom efficient. Weight is a 
reliable index of the state of a person’s physical fitness.2® The obese 
individual has a high frequency of accidents and minor sprains.?% 

Few studies have been made showing the direct relationship be- 
tween specific motor performances and adipose tissue. The use of calipers 
to measure the fat overcoat is a relatively recent development in physical 
fitness measurement. Grover’s!® findings (1936) were of some signifi- 
cance. The correlations he obtained between motor ability and adipose 
tissue are summarized as follows: The Rogers’ Physical Fitness Index 
correlated —.404 + .046 with adipose tissue and indicates that the 
physically fit individuals have less adipose tissue, based upon Rogers’ 
normative tables. The Cozens’ All-around Athletic Ability Test cor- 
related —.202 + .504 with the Grover-Cureton fat index. Slightly 
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better performance in athletics is obtained when less fat is on the body; 
other correlations obtained showed negligible relationships. These are: 
440-yard run correlated with adipose tissue —.113 + .067; McCurdy- 
Larson Organic Efficiency Test correlated with adipose tissue —.056 
+ .058; and 100-yard swimming endurance correlated —.064 + .094 
with adipose tissue, using Cureton’s 5-lap Drop-off Endurance Test. 

Studies made by Cureton (1941)® show that endomorphs and 
ectomorphs gave poor performances in comparison with mesomorphs in 
the Rogers’ Strength Index, sprinting, 440-yard run, broad jump, flexi- 
bility, Brace Test, and the Cozens’ Test. Hall (1942) found obese farm 
boys to be relatively weaker in leg-lifts on the high bar, chins, push-ups, 
and jumps 


Bruch (1940-41)4 found that the majority of obese individuals 
were inactive. Boothby (1922) found that the metabolic rates of 98 
percent of the obese men were within the range of + 15 and — 15; 
however, Lucas (1933)19 found that the obese individuals had a lower 
metabolic rate than the linear type of individual within this range. 
These authors suggest the great importance of the lack of exercise and 
a fattening diet as the principal causal factors as only 2-4 percent of 
all obese subjects can attribute the cause to metabolic disorder rather 
than to habits of overeating and inadequate exercise. 


Steinhaus has attempted to show the public how difficult it is to 
lose a pound of fat through exercise. Listed below are the tables of 
“human mileage.” The tables show the amount of exercise or work that 
is needed over and above that required for just lying still. These tables 
are approximate and assume that the individual is of average size and 
weighs 155 pounds. This type of an individual, in order to burn off a 
pound of fat, must perform the following amounts of work. The fol- 
lowing are direct quotations from Steinhaus?5: 


1. Run 129 one hundred-yard dashes at ten seconds each. 

2. Run 17.3 miles at a rate of a mile in six minutes. 

3. Play strenuous football for four hours and forty-eight minutes. 
4. Wrestle five and one-half hours. 

° 


Use an electrical vibrator machine for 375 fifteen-minute periods. 

6. Walk thirty-four and three-fourths’ miles, at a rate of five miles 
per hour. 

7. Fence for eight hours. 

8. Saw wood for ten and one-half hours. 

9. Do parallel bar work for eight and one-half hours. 

10. Play ping-pong for seventen and one-fourth hours. 

11. Walk sixty-six and one-half miles, at a rate of mile in seven- 
teen and one-half minutes. 

12. Drill for twenty and one-half hours. 

13. Walk 144 miles at a rate of two miles per hour. 
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14. Do ordinary office work for eleven eight hour days. 

15. Shovel 114,739 pounds of sand into a wheelbarrow. 

16. If he is a skilled mason, lay 14,731 bricks. 

17. Walk to the top of the Washington Monument forty-eight 
times. 

18. Do 5,714 push-ups from the floor. 


Generally, Steinhaus claims that it is unsafe for obese individuals 
to attempt to reduce through exercise. The appetite is stimulated through 
the participation in exercises, and the loss of weight is mainly due to a 
loss of water, which is regained later. He recommends that the best 
means to lose fat is through diet. 


C. Experimental Effect of Exercise Upon Adipose Tissue as Meas- 
ured by Calipers.—The utilization of fat through exercise has been shown 
by Anderson and Lusk (1917),! Rapport (1929),°% and Gemmill 
(1942).15 Also Gardiner (1925),14 and Courtice and Douglas (1929) ° 
have shown that the utilization of fat is higher when the intensity of the 
exercise increases. Courtice and Douglas (1929) and Dill (1935)?! 
found that the burning of carbohydrates is more efficient than protein 
and fat during exercise. The studies of Hotzbaur (1927)17 and 
McCurdy (1939)2° indicate that weight can be lost through exercise. 
However Tuttle (1943)27 advocates diet and exercise. Tuttle found 
that loss of up to 5 percent of the body weight does not have any detri- 
mental effect on the body. 

No fat was actually measured in these studies. No other studies 
were found in searching the literature which measured the reduction of 
adipose tissue concomitantly with the practice of specific exercises. 


Il. MEASUREMENT AND STATISTICAL RELATIONSHIPS OF FAT 
TO PHYSICAL PERFORMANCES 


At the University of Illinois measurements of external fat were 
made on students in various physical education classes and these same 
students were given a large number of physical fitness tests. The results 
were correlated by the Pearon scattergram statistical procedure to de- 
termine the relationships of fat to these performances. 


Techniques of Measurement for External Fat—The Franzen type 
calipers! were used for measurement of the external fat. The calipers 
used had 200 gms. of force at 20 mm. and 400 gms. of force at 30 mm. 
of opening between the jaws. In making the measurements, the calipers 
were held in the right hand. The volar surface of thumb and index finger 
of the left hand pinched up the adipose tissue to free it from the muscles. 
Then the measurements were made by placing the jaws of the caliper 
blades below, parallel and in line with the tips of the thumb and fore- 
finger. The pressure of the spring of the calipers held the tissue firmly 
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and the indicator on the scale gave a measure of the thickness of the 
adipose tissue in millimeters. When the tissues were very firm and the 
calipers began to slip, the blades of the calipers were then pressed inward 
just enough to keep the tips of the blades in line with the left thumb 
and forefinger.13-7 Therefore, the absolute measurements include a 
double layer of adipose tissue and skin. The skin thickness was neglected. 

The specific techniques of measuring adipose tissue were as follows. 
The technique used was identical with the technique used by George 
H Grover.!6* 

Cheeks: Measurment was made on the right cheek, level with 
the lower part of the nose, at the center of the cheek. 

Abdomen: The measurement was made two inches to the left of 
the navel 

Hips: Measurement was made on the right hip at a point located 
at the center of the sagittal plane of the body, and even with the top 
of the crest of the illium. 

Gluteal: Measurement was made on the left buttocks, level and a 
little above the gluteal fold. 

Rear Thigh: Measurement was made at the center of the rear 
thigh, half-way between the knee and cleft of the gluteal fold with the 
foot relaxed and knee bent. 

Front Thigh: Measurement was made at the center of the front 
thigh half-way between the knee and the groin with the foot relaxed and 
off the ground. 

Cureton (1941) made a normative study on adipose tissue of young 
men using the Grover-Cureton Weighted Adipose Tissue Index, de- 
veloped at Springfield College, on young athletic subjects. Rating tables 
for total weighted adipose tissue are provided but in a further study at 
Illinois University (1944) tables were provided for fat on the cheeks, 
abdomen, hips, front thigh, rear thigh, and the total based on direct meas- 
urement of external fat of 332 cases of young men at the University of 
Illinois. 


Reliabilities of the Measurements.—Grover and Cureton have found 
high reliabilities for the adipose tissue measurements. The reliabilities 
for the six measurements are as follows: 


Cheeks .862 + .028 Gluteals .968 + .006 
Abdomen 975 + .005 Front Thigh .968 + .006 
Hips 901 + .019 Rear Thigh .944 + .001 


The reliability for the six measurements summed was found by 
Cureton to be .90. Grover did not give the composite reliability. These 
reliabilities as a whole are satisfactory. 

A. Statistical Findings, 1942 Data Showing Relationships to Vari- 
ous Physical Tests—Correlations were computed between the Adipose 


*Cf. Cureton, Physical Fitness Workbook, p. 110, for illustration. 
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Tissue Index (Sum of 6 measurements on cheeks, abdomen, waist, front 
thigh, rear thigh, gluteal fold in mm.) and 23 various physical perform- 
ances, using 113 young male subjects in the basic physical fitness classes 
at the University of Illinois. The results are as follows: 


TABLE I 


CoRRELATIONS BETWEEN ADIPOSE TISSUE AND MOTOR PERFORMANCE 
(Young Men, Basic Physical Fitness Course, PEM 60, University of Illinois) 








Motor Skills r r* P.E. rll 
(true) 
Composite Running Index** —.492 —.578 +.047 +.80 
Mile Run —.399 458 +.033 +-.84 
Chins —.397 427 +.032 +-.95 
1000-Yard Run $98 —.436 +.037 +-.86 
100-Yard Dash —.337 399 +.036 +.79 
2-Mile Run —.336 382 +.036 +.80 
Broad Jump —.326 —.360 +.038 +.90 
314-Mile Run —.287 33 +.042 +.80 
Fence Vault —.286 307 +.048 +.95 
Stick Body —.210 —.264 +.045 +.71 
Frog Stand —.167 +.041 +.92 
ops a -.146 +.040 +.82 
Dive and Roll oe, 117 +.036 +-.95 
V-Sit \—.112 +.038 +.80 
Sit-Ups ..072 +.038 +.83 
Trunk Extension Flexibility —.066 +.058 +.94 
Man Lift —.066 +.036 + 87 
Leg Lifts —.056 +.037 +.73 
Medicine Ball Put —.009 +.037 +.82 
Diver’s Stance —.004 +.048 
Schneider Index —.010 +.045 +.86 
Trunk Flexion —.044 +.061 +.95 


Extension Press‘Ups _ —.053 207s —S 


*Corrected for attenuation. Cf. H. E. Garrett, p. 213, Statistics in Psychology 
and Education, 1933: 
V AiBy 
VAB = — ——_"__"—_—- 
VAyAo VB,Be 


**Composed of the sum of the standard scores for mile, 2-mile, 1000-yard 
drop-off and 3!/4y-mile steeplechase runs. The other events are described in pre- 
viously published sources.” 


B. Statistical Findings, 1945 Data.—-Correlations of the Adipose 
Tissue Index with 23 other physical tests and measurements gives evi- 
dence to show how external fat is related to various aspects of physical 
fitness, including physique, organic reactions to exercise, and motor per- 
formances. Cureton obtained the following correlations on 110 Univer- 
sity of Illinois freshmen in the medical school (18 to 25 years of age): 











ee 








EXTERNAL FAT TO PHYSICAL EDUCATION 129 


TABLE II 


CORRELATIONS BETWEEN ADIPOSE TISSUE AND 
VaRIOUS PHYSICAL FITNEss TESTS 


(N = 113) 
r P.E. 

Adipose Index with the Ponderal Index +.676 +.035 
Adipose Index with Calf Girth +.627 +.037 
Adipose Index with Vital Capacity Residuals —.525 +.047 
Adipose Index with Biceps Girth +.542 +.046 
Adipose Index with Strength/lbs. of Weight —.406 +.053 
Adipose Index with Skeletal Index +.399 +.054 
Adipose Index with Illinois 14-Item Motor Fitness Test —.345 +.056 
Adipose Index with Composite of the Standard Scores for 

the Mile Run, 14 and 18-Item Motor Fitness Tests —.340 +.056 
Adipose Index with Criterion of 22 Tests 

(Sum of Standard Scores) —.218 +.060 
Adipose Index with Expiratory Force on U-Tube —.203 +.061 
Adipose Index with Mile Run —.202 +.061 


The results in Table III show fairly high negative correlations be- 
tween the external fat as measured on six different places on the body 
and the composite endurance running criterion composed of the sum 
of the four standard scores for the mile, 2-mile, 3/z-mile steeplechase, 
and the 1000-yard drop-off index. This means that less fat goes with 
better running. 


TABLE III 


CoRRELATIONS BETWEEN COMPOSITE RUNNING INDEX AND THE FAT OF THE 
CHEEKS, ABDOMEN, Hips, GLUTEAL FOLD, REAR THIGH, AND FRONT THIGH 


T 

Location of Measurement T (true) P.E. 

Abdomen —.640 —.737 +.037 
Gluteal —.498 —.588 +.049 
Front Thigh —.394 —.424 +.059 
Hips —.381 —.409 +.059 
Rear Thigh —.374 —.425 +.059 
Cheeks —.262 —.262 +.060 


Ill. EFFECT OF STRENUOUS TREADMILL RUNNING ON 
EXTERNAL FAT 


Three relatively fat students were selected from a class in physical 
eJucation at the University of Illinois in the summer of 1944. They 
were of the endo-mesomorphic type and are shown in Illustration I. 
The body weight and the six adipose measurements were made at the 
start of the experimental work and at the start and end of every day 
of the experiment thereafter. Measurements were also made at the start 
of each period of the Schneider Index. The subjects ran several times 
each period, doing all they could do on each trial. 














Illustration I. 


Top: 


RESEARCH QUARTERLY 


The experimental subjects; Bottom: 
exercising on the treadmill. 


The method 








of 














131 


DUCATION 


4 


E 


TO PHYSICAL 


FAT 


RNAL 


EXTE 














RESEARCH QUARTERLY 


Chart I shows the results of six weeks of strenuous running. There 
is a definite trend for the external fat to diminish as the miles ran 
accumulated from week to week. There was not much weight loss, 
which suggests that the men added muscle density as the fat reduced. 
The temperature and relative humidity were recorded each period but 
were not controlled as there were not facilities for such atmospheric 
control. 

The fat loss at each place measured on the body over the whole 
six-week period is shown in Table IV. 


TABLE IV. 
EXTERNAL Fat Losses AT EACH PLACE ON THE Bopy 
(N = 3) 
Per Cent 
Differences 
Location Start End Differences (mm.) C.R. 
Gluteals 37 27 10 27.1% 1.228 
Rear Thigh 39 28 11 28.2% 1.307 
Front Thigh 39 34 5 7.8% 742 
Cheeks 25 22 3 12.0% .867 
Abdomen 37 31 6 16.2% .742 
Hips 37 25 12 32.4% 1.462 


It appears that relatively more fat was lost from the hips, rear 
thigh, and buttocks in this up-hill running than from the cheeks, front 
thigh, and abdomen. However, since the critical ratios for the differences 
between the means are all lower than 3, these differences cannot be 
guaranteed as to statistical significance. 


CONCLUSIONS 


1. Previous studies on body fat have not related measurements of 
external fat by the caliper method to various amounts of exercise or 
types of class work in physical education. 


2. There are significant negative correlations of the magnitude of 
—.578 to —.264 between performances of strenuous physical exercises 
and external fat on the body, the correlations with endurance running 
being relatively greater than for other test exercises. Fat is a real handi- 
cap in most strenuous exercises. 


3. Massive bodies (by the ponderal index) tend to have greater 
amounts of external fat and the larger calves and biceps are usually 
larger because there is more external fat present. 


4. Cardiovascular and respiratory measurements do not seem sig- 
nificantly related to external fat within the range of the group data 
considering the low correlations obtained (—.203 to -+-.122) but in the 
fat reduction experiment using three subjects in treadmill running there 
was a progressive gain in the Schneider Index as the training proceeded. 
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5. In strenuous endurance running the fat on the abdomen and 
buttocks seems to be more of a handicap (with negative correlations 
with performances, —.737 to —.588) than fat on the thighs and cheeks 
(—.394 to —.226). 


6. Strenuous treadmill running in August and September caused 
an appreciable loss of external fat over six weeks of training in 30-minute 
periods, three times per week. The loss was proportional to the miles 
cumulatively run, considered as the total distance added for a series 
of short runs in each period, summed over all the periods and converted 
to miles 


8. Weight is not a good guide to the fat loss, possibly because fat 
loss is compensated for by increased muscular density due to the exercise. 
The fat loss was at a greater rate than the weight loss in this experiment. 

9. The average Schneider Test scores began at 5 for the first period 
and finished at 11 on the last period of the sixth week of training, a 
gain of 120 percent from the starting level in raw scores, equivalent to 
a gain from 30 to 50 on the standard score tables,* a standard score 
gain of 20 percent on the 100-point scale based on a normal distribution. 
In all but the 4th week of training the Schneider Index was regularly 
lower on Monday mornings after the week-end and improved during 
the week with treadmill running every other day. 
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The Sexual Maturing of Girls As Related 
To Growth in Strength 
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HE association of sexual maturing with specific changes in growth 

rates has been clearly shown by Shuttleworth®* in a study of 

the physical growth of girls classified according to their age at 
menarche. A similar relationship between sexual maturing and func- 
tional traits (e.g., static dynamometric strength) may be less easy to 
demonstrate because differential factors associated with incentive, fit- 
ness, and physical activity may tend to obscure basic growth phenomena. 
determined, in a normal sample, by intrinsic rather than extrinsic factors, 
we would expect growth curves in different menarcheal groups to reflect 
differences similar to those already demonstrated for physical structures. 

Prior to the analysis of age changes in strength, it may be useful 
to compare strength data for pre-menarcheal and post-menarcheal girls 
or the same chronological ages. This method has been employed, in cross- 
sectional studies, to examine the influence of physiological maturing 
upon various characteristics. It has not always rested upon adequately 
reliable data, for the age at menarche is sometimes based on retrospective 
reports, obtained from several months to several years after the event. 
In the present investigation longitudinal records have been used, collected 
in connection with a study of adolescent growth. Table I presents a 
comparison of the average strength of grip** for groups known to be 
pre- or post-menarcheal. At each of the three age levels marked differ- 
ences occur; these differences are significant at the .1 percent level of 
cenfidence 

Table II presents similar data, in somewhat different age classifica- 
tions, for a “Total Strength” composite, consisting of Right Grip, Left 
Grip, Pulling Strength, and Thrusting Strength.*** At ages 12.25 to 
13.25, the differences between the two groups are about 1 §.D.; more 
than 80 percent of the post-menarcheal girls excel the mean for the less 
mature girls of the same age. After 13.25 the difference becomes smaller, 


The material in this report is presented in greater detail as a chapter in a forth- 
coming book, Motor Performance and Growth. ® 

*Superior figures refer to numbered bibliography at end of article. 

**Streneth of Grip was tested for right and left hand separately, using the best of 
three records for each hand. 

***The technique of measurement is presented in reference No. 6. The data of 


Table II are adapted from a master’s thesis by Caldwell based on the girls of the present 
sample. The class interval in Caldwell’s study is from 12.0 to 12-49 years, ete. 
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as a result of continued growth among the less mature and the reduction 
or cessation of growth among those well past the menarche. 


TABLE I 
COMPARATIVE STANDARD Scores* (RIGHT Grip) For Girts oF DIFFERING 
PHYSIOLOGICAL STATUS 


Pre-menarcheal Post-menarcheal 
Age** N Mean S.D. N Mean S.D. t 
(kg.) (kg.) 
12.5 56 44.5 9.4 23 54.1 8.4 4.2 
13.0 46 43.8 10.0 38 50.7 8.7 3.3 
13.5 26 38.7 8.9 50 49.0 8.2 5.0 


*Raw scores at each age have been transformed into scale scores, relative 
to a mean of 50 for the total group at that age, and with an S.D. of 10. 
**The class interval is 12.25 - 12.74, etc. 


TABLE II 
A COMPARISON OF PRE- AND POST-MENARCHEAL CASES IN TOTAL STRENGTH 
Pre-menarcheal Post-menarcheal 
Age N Mean &.D. N Mean S.D. t 
(kg.) (kg.) 
12.25 63 86.0 16.30 15 100.6 15.55 3.11 
12.75 44 92.0 16.35 24 105.6 15.30 3.31 
13.25 25 93.0 16.20 37 108.8 16.35 3.68 
13.75 20 97.1 16.35 54 105.6 16.50 2.00 
14.25 17 100.7 19.50 63 107.9 18.00 1.42 


Tables I and II demonstrate that among girls of the same chron- 
ological age, those who are past the menarche are stronger than those 
who have not reached the menarche. We may interpret this as due to 
the fact that the former group have had the full effect of the pre- 
menarcheal growth spurt. 

The anatomical basis for the relationship between strength and 
sexual maturity is to be seen in Reynolds’? recent report concerning 
the comparative growth of tissues as measured from x-rays; in the growth 
curves which he shows for early- and late-maturing girls (classified by 
breast development), curves for breadth of muscle begin to diverge, 
for the two groups, as early as 8 years. In the prepuberal years the 
early- and late-maturing appear to be more sharply differentiated by 
growth of muscle than by growth of bone or fatty tissue. 

As a further approach to a study of the factors represented in 
Tables I and II, correlations were determined between Strength of Grip 
and (1) chronological age and (2) age deviation from menarche. These 
correlations were based on successive testings, at half-year intervals, of 
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the children at three school grade levels. The results, in terms of Pearson 
r, are given in Table III. 


TABLE Ill 


THE CORRELATION OF STRENGTH WITH CHRONOLOGICAL AND 
PHYSIOLOGICAL AGE 


Strength of Gri 


Average Strength of Right Correlated wit 

Age N Grip Correlated N Age Deviation 
with C. A. from Menarche 

11.5 92 .196 79 .429 

12.0 104 .201 90 385 

2.9 99 .193 90 417 


Within the restricted age range of each grade the table shows a 
moderate correlation (significant at the .1 percent level) between 
strength and maturity expressed in terms of the number of months 
from menarche.* A _ consistently lower correlation occurs between 
strength and chronological age. At any one age the difference between 
coefficients is significant at only the 5 percent level, but the stability of 
results in three successive testings deserves to be noted. It should not 
be surprising to find that strength depends more on individual maturity 
than on the calendar of years from birth; evidence of this nature, how- 
ever, fails to show the precise temporal relationships between sexual 
maturing and the growth curve for strength. 

At this point, we may utilize the more characteristic advantages 
of longitudinal data by examining age changes in terms of cumulative 
growth curves. Three groups of girls were selected from the Adolescent 
Growth Study #4 ° with reference to their age at menarche. Repre- 
senting the two extremes and the middle of the normal sample, these 
nay be considered to be early-maturing, average, and late-maturing. 
The early group included only cases in which the menarche occurred 
before the age of 12 years; in the average group it occurred between 
12.5 and 13.5, and in the late group after 14. The mean ages at menarche 
for the three groups were, respectively, 11.7, 13.2, and 14.8. 

It may be noted in Table IV and Figure 1 that early-maturing 
girls have a more d’stinct growth pattern than is true of the other two 
groups; their growth is extremely rapid between 11 and 13 years, and 
slow thereafter. The late-maturing, on the other hand, grow at a more 
uniform rate, with continued gains at an age when the early-maturing 
are gaining very little, if at all. At the age of 13 the differences between 
the two contrasting groups are significant at the 1 percent level,** but 


*Pre-menarcheal cases are plotted with a minus age deviation from menarche; post- 
menarcheal cases, with a positive age deviation. 
$+ 2.75, with 30 d.f. S.D.’s are 4.6 and 4.9 for the early and late groups respectively. 
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4 


TABLE IV 


MEAN Scores (KG., RIGHT GRIP) FOR EARLY-, AVERAGE’, AND 
LATE-MATURING GIRLS 


Age* Early Average Late 

N: 16 24 16 
11.0 21.1 20.9 20.6 
11.5 24.4 23.2 21.2 
12.0 26.1 25.8 24.5 
42.3 29.1 26.8 23.7 
13.0 30.3 28.8 25.7 
13.5 29.3 30.3 26.8 
14.0 29.7 30.7 26.4 
14.5 31.0 32.2 28.4 
15.0 32.5 33.3 31.4 
3.5 33.4 35.2 32.7 
16.0 33.4 35.8 32.4 
16.5 34.7 36.1 34.4 
17.0 34.3 36.5 34.8 
17.5 33.9 37.8 35.3 


*The class interval is 10.75 to 11.24, etc. 


at 14 and beyond this is no longer true. 


The intense velocity of growth in the early-maturing, followed 
by a prompt leveling off, is paralleled by growth data on height and 
other physical dimensions,* and suggests that adolescence is a somewhat 
different experience for those who mature at different rates. It is prob- 
able that the physiological imbalances characteristic of a period of growth 
are more acute among the early- than the late-maturing, but with the 
latter these imbalances are, so to speak, chronic over a relatively long 
period. The effect upon the individual may vary widely from case to 
case, but in view of the interrelationship of different aspects of growth, 
teachers and counsellors should not neglect the possible role of precocious 
or delayed maturing, in connection with problems of adolescent ad- 
justment. 

After 13 the scores for the early group tend to drop below the 
average, although not to a statistically significant degree. Such a drop, 
however, is not unexpected in view of previous studies of growth in 
physical size. In the same sample, Bayley has found that whereas early- 
maturing boys are still above average in height at age 17, early-maturing 
girls fall lower than the average at this age.2»?- 55 Similarly, Shuttle- 
worth has reported that girls with the earliest ages of maximum growth 
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Fic. 1. Growth Curves (Right Grip) for Early-, Average-, and Late- 
maturing Girls, Classified on the Basis of Age at Menarche. 





are the tallest in the Harvard Growth Study up to about 13 years, but 
are the shortest at 17.9 P- 11 Physical precocity in girls appears, then, 
to be associated with an early arrest in growth; the same is not true 
of poys. 


It is apparent that late-maturing girls lag behind the average at 
all ages. The difference is small, but its consistency may support the 
hypothesis that among girls (as among boys) the constitutional factor 
of ectomorphy, with relatively weak muscular development, is associated 
with retarded growth.® 

In spite of the fact that growth curves in strength among the early- 
and late-maturing are similar to those in physical dimensions, the 
question may be raised as to whether the awareness of maturing, in 
post-menarcheal cases, may have a sharply inhibitory effect upon motor 
performance, perhaps by way of altered motivation. A careful study of 
individual records fails to reveal evidence of this, although it is probably 
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true that in later adolescence girls are, in general, less readily motivated 
than boys in physical activity tests. 


An independent determination of rate of maturing is available from 
x-rays of the hand and knee, which were taken semi-annually and 
assessed by Dr. Nancy Bayley with reference to the Todd skeletal age 
norms.* From these cumulative records, two contrasting groups of early- 
and late-maturing girls were selected. As in the case of the groups 
chosen by the menarcheal age criterion, they do not consist of clinical 
extremes, but are merely arbitrary truncations of a normal distribution 
of school children. The early-maturing represent approximately the 
earliest 20 percent in a normal public school population; the late-matur- 
ing include approximately the 20 percent slowest to mature. 
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Fic. 2. Distributions, at Two Ages, of Total Strength Scores for Early- 
and Late-maturing Girls, Classified on the Basis of Skeletal X-Rays. 


Figure 2 presents distributions of total strength (Right and Left 
Grip, Pull, Thrust) for 16 early- and 16 late-maturing girls (by the 
criterion of skeletal age). Each dot or circle represents an individual, 
with the mean of each distribution indicated by an arrow. In the upper 
half of the figure, for age 12.5 years, the difference in distributions is 
so marked that none of the late-maturing girls reach the mean of the 
early-maturing. In the lower half of the figure, at age 15.5, this differ- 





*The technique of x-ray assessments, in this study, is described in reference No. 2 
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ence has disappeared, and the early- and late-maturing have a closely 
similar spread of scores. 

A comparison of Figures 1 and 2 leads to similar inferences as to 
the strength characteristics of different maturity groups. The contrasting 
samples of cases reach about the same strength level after the age of 
15, but by very different routes. 

An additional method of comparing age changes is through the 
use of standard scores. Relative strength levels, and changes in these 
levels, are brought out more clearly when strength measures in physical 
units are transformed into scale values (relative to a group mean of 
50 and a group S.D. of 10). These computations have been made for 
the early and late groups, and the results are shown in Table V and 
Figure 3. Figure 3 provides graphic evidence of an increasing relative 
growth rate of the early-maturing girls from 11 to 13, followed by a 
terminal slowing-up.** The late-maturing girls exhibit a growth phase 
which is not shown for the early-maturing, because our data begin at 
too late an age to include it; this is the recession phase preceding the 
puberal growth spurt, a characteristic slowing in the rate of growth, 
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_. Fic. 3. Standard Score Curves (Right Grip) for Early-and Late-maturing 
Girls, Classified on the Basis of Age at Menarche. 








**A rise in a standard score curve indicates an increasing rate of growth, relative 
to the total sample; a fall in the curve indicates slower-than-average growth. 
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which has been noted also in pre-puberal growth curves for physical 
dimensions.8 The recession phase is followed by an increasing relative 
growth rate after 14. Among both early- and late-maturing the growth 
pattern has a definite relation to the menarche, which marks the approxi- 
mate termination of the puberal growth spurt in strength. 


TABLE V 
MEAN STANDARD SCORES FOR EARLY- AND LATE-MATURING GIRLS (RIGHT GRIP) 
Age* Early Late 
N: 16 16 
11.0 50.4 48.9 
12.0 54.2 47.0 
13.0 54.9 46.1 
14.0 7t.2 44.7 
15.0 49.8 48.5 
16.0 49.8 47.6 
17.0 48.9 49.3 


*The class interval covers half a year, 10.75 to 11.24, ete. 


SUMMARY 


The effect of physiological maturing upon growth in static dyna- 
mometric strength was examined by several methods in a normal sample 
of girls in a public school system. 


1. Post-menarcheal girls are stronger than pre-menarcheal girls of 
the same age. 


2. As shown by correlations, strength appears to be more closely 
related to physiological age (age deviation from the menarche) than to 
chronological age. 


3. Growth curves in strength for groups classified as early, average, 
and late, in the age at reaching menarche, show that the late-maturing 
leg behind the average at all ages; the early-maturing have a tendency 
toward more rapid growth from 11 to 13, but their growth is retarded 
relatively soon and they reach a terminal position below the group mean. 
These results tend to support the hypothesis that endocrine factors 
which promote early puberal growth in girls also lead to an early arrest 
of growth. 


4. Comparisons of distributions for early- and late-maturing (classi- 
fied on the basis of skeletal age) show conspicuous differences at age 
12.5, which is close to the time of maximum superiority of the early- 
maturing. With a continued slow but persistent growth of the late- 
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maturing, these differences disappear in the course of the next three 
years. 

5. As shown in standard score curves, the pre-puberal growth spurt 
in strength begins about one year before the menarche, and reaches a 
peak close to the time of menarche. 

6. The early- and late-maturing reach similar strength levels in 
later adolescence, but through different patterns of growth. 

Since the physiological factors which underly these contrasting 
giowth patterns may be expected to find expression in many other aspects 
of adolescent development, it is a reasonable inference that measurement 
and guidance in physical education should be concerned not merely 
with individual differences in status at a given time, but also with indi- 
vidual differences in the process of maturing. 
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The History of School Health Education 
In West Virginia, 1863-1945 


By GLApyce H. BRADLEY 
Morgan State College 
Baltimore, Maryland 


(Submitted for publication, January, 1947) 


Ai ITHOUT history,” says Juan Luis Vives, the Spanish scholar 
and writer, “no one would know anything about his father 
and ancestors, no one would know his own rights or those 

of another.” It is necessary to know the history of school health educa- 
tion to understand and appreciate the increasing importance of child 
health, the efforts that have been made by various groups and individuals 
to protect the source of power of the nation, and the vast amount that 
remains to be done in the field of school health education. 

This study is concerned with the development of school health edu- 
cation in West Virginia, beginning with 1863, the year of the state’s 
formation and ending with 1945. By separating the early days of the 
state’s existence, those of World Wars I and II, the time of the intro 
duction of significant trends, developments or changes in health concepts, 
the time span of eighty-two years has been divided into six periods 
around which the data are presented. Sources of data used include the 
reports of the state suprintendent of schools, State Department of Edu- 
cation, public health laws, school laws, code of West Virginia, and 
additional sources which are included in the bibliography. These reports 
are authoritative in that they were prepared by responsible officers of 
the state, and complete in that they cover the entire period of educational 
and health activities. 


Period of the Nineteenth Century, 1863-99-—When West Vir- 
ginia was separated from Virginia on June 20, 1863, it was recognized 
that a school system was desirable. At this time there was little interest 
in the nation as a whole concerning health education. The increased 
concern about schoolhouse architecture was not based fundamentally 
on health improvement. Courses in physiology and hygiene, which were 
offered in some schools, dealt chiefly with factual information. In the 
vast majority of schools little attention, if any, was paid to these subjects, 
probably because the authorities were principally interested in providing 
instruction in the fundamentals of reading, writing, and arithmetic. Until 
these had been provided it was practically useless to think in terms of 
broadening the curriculum. But as time went on and the general con- 


A dissertation submitted in partial fulfillment of the requirements for the degree of 
Doctor of Philosophy at the University of Michigan, April, 1946. 
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ditions of the school system improved, certain factors developed which 
led to more interest and emphasis on school health. 

The establishment of the State Board of Health on March 3, 
1881,1 perhaps did more than any other factor to develop interest in 
school health education in West Virginia. One of the duties of this 
Board was to make sanitary investigations. In compliance with this 
regulation a complete survey of the sanitary conditions of the schools 
was made in 1884. Many deplorable conditions were found. Recommen- 
dations were made for the improvements of the environmental condi- 


> 


tions of schools.* 


Advances made in school health education during the nineteenth 
century were largely due to the broad vision of B. L. Butcher, state 
superintendent of schools, and to the general interest of the State Board 
of Health in more hygienic school surroundings. The initial health in- 
struction introduced in schools was instruction on the effects of alcoholic 
drinks, stimulants, and narcotics on the human body. As early as 1885, 
State Superintendent of Schools Morgan urged that such instruction 
be given in all schools. This was thought to be necessary because of the 
widespread evil of intemperance, and the ignorance of the population 
about the effects of alcoholic drinks and narcotics on the body.4 A 
similar opinion was shared by the State Board of Health. As a result 
of this interest and the activity of the Women’s Christian Temperance 
Union, begun in 1879, the Legislature in 1887 passed an act governing 
instruction in alcoholic drinks and narcotics.° This law, like those passed 
by forty other states at about the same time, specified that the instruc- 
tion should be a part of a broader program of instruction in physiology 
and hygiene. The passage of this law marked the beginning of the 
teaching of hygiene on a broad scale. It is interesting to note that as 
early as 1842 Horace Mann had pointed out the need for such in- 
struction. 


In 1878 the Legislature provided that Cutter’s Anatomy and Physi- 
ology was to be used exclusively.6 The three books to be used were 
Cutter’s Beginner's Anatomy, Physiology, and Hygiene, Cutter’s Inter- 
mediate Physiology and Hygiene, and Cutter’s Comprehensive Physi- 
ology and Hygiene. 


Instruction dealt chiefly with factual information. This is shown 
by the following set of questions for the third grade’: 


Apparatus: Blackboard and chart 
Subject: Our bodies 
Divisions of body: Head, trunk, and limbs. 


1 Re port of the State Board of Health, 1881-2, p. 7. 

2 Ibid., p. 21. 

3 Biennial Report of the State Superintendent of Schools, 1898-1900, p. 10. 

4 Ibid., 1885-7, p. 18. 

5 Code of the State of West Virginia, 1887, Chap. 45, Sec. lla. 

6 Ibid., 1894, Chap. 45, Art. 58. 

7 Biennial Report of the State Superintendent of Schools, 1889-90, pp. 54-5. 
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The Head 
The head is a bony box called the skull. 
The head contains the brain. 
The brain is the seat of thought. 
Show how alcohol and tobacco affect the brain. 


>awnwre 


The Trunk 


The trunk is divided into chest and stomach. 

Explain position and office of heart and lungs. 

Show effects of alcohol and tobacco on the heart, blood, and breath. 
Explain the position and office of stomach. 


Yee 


The Limbs 

1. Explain the use of bones, and joints, and muscles. 

2. Show how the joints may become stiffened by allowing the 

extremities to remain cold for a length of time. 

3. Show the effects of alcohol and tobacco on the limbs in unsteady 

gait and trembling of the hands. 

Period Preceding World War I, 1900-13.—During this period 
school and health authorities continued to be concerned with the im- 
provement of environmental conditions pertaining to health.§ Likewise, 
there was keen interest in medical inspection of school buildings. The 
request by the state superintendent of schools, Morris P. Shawkey, for 
architectural standards for school buildings made known the desire of 
the state for an improved building program.® 


Another aspect of school health education which was introduced 
and emphasized at this time was that of medical inspection of school 
children. The first function of the public schools in the field of health 
was protection. When laws were passed demanding that every child 
attend school, it became incumbent upon the state to see that the child’s 
health was not impaired in meeting this requirement. Later it was recog- 
nized that there were conditions in the schools that offered a menace 
quite apart from sanitation, namely, those menaces which were chiefly 
in the form of communicable diseases, such as smallpox, chicken pox, 
measles, diphtheria, typhoid fever, and whooping cough. School authori- 
ties noticed that during the fall especially there was an epidemic of 
communicable diseases, directly following the opening of public schools. 
Communicable disease control, therefore, was the first function of early 
school physicians and health officers in schools. 

Physicians in general, however, soon observed that some children 
who were not ill from communicable diseases needed medical attention. 
They realized that if the health of the child were to be protected it 
was necessary to do more than be sure that he was not ill from a com- 
municable disease, nor made ill from the effects of an unsanitary en- 
vironment. 

It became necessary to establish a program commensurate with the 


8 Ibid., 1902-04, p. 58. 
9 Loe. cit. 
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child’s needs. The first part of this program was medical inspection. The 
legislation of 1911,1° which required the boards of education of inde- 
pendent districts to provide medical inspectors and was permissive in 
other districts, still remains unchanged. That the law was permissive 1n 
some districts has been a great deterrent to the school health program. 

Although health instruction was better organized during this period 
than the preceding one, the subjects and their content remained the 
same. However, there was a pronounced irregularity in the enrollments 
of courses in physiology, hygiene, and sanitation.11 This irregularity 
probably was influenced by the general science movement, which began 
about 1900, and by the developing interest in home economics. 


Period of World War I, 1913-19.—This international crisis focused 
the attention of the state and nation on the importance of health. Health 
improvement was the keynote of the period. The numerous rules and 
regulations which were passed were directed toward this end. In addi- 
tion, the activities of the West Virginia Anti-Tuberculosis League,!? 
the survey of the State Health Department by Doctor Taliaferro Clark 
of the United States Public Health Service,1# and the establishment of 
the McDowell County Dental Clinic!4 were similarly concerned with 
the improvement of personal health. Furthermore, the factual material 
of health courses was gradually supplemented by applied health prin- 
ciples. Finally, physical education was recognized as a fundamental part 
of the school curriculum. 


From the findings of medical inspection and physical examinations 
it became obvious that it was necessary to do something to correct the 
defects observed. With this extension of the health concept the second 
phase of the school health education program evolved, the correction of 
physical defects. Parents were notified of their children’s defects and 
urged to secure the necessary corrections. School nurses were provided 
by boards of education and health departments to participate in these 
activities. 

Nurses were considered particularly necessary in the follow-up 
program. The scope of this program has been expanded to include 
many important activities. It was found that some families did not 
have money to pay for corrections and immunizations. Therefore, it 
became necessary to develop free clinics. These were financed by the 
State Health Department, boards of education, Parent-Teacher Associa- 
tions, Koppers Coal Company, and other agencies and organizations. 

Period Following World War I, 1920-29.—The modern program of 


10 Acts of the Legislature, 1911, Chap. 31, Code of the State of West Virginia, 1919, 
Chap 2, Art. 64 

11 Report of State Supervisor of High Schools, 1915-16, p. 17. 

12S. Adolphus Knopf. A History of the National Tuberculosis Association. New 
York: National Tuberculosis Association, 1922, p. 22. 

13 Public Health Reports, XXX, p. 238. 

14 Biennial Report of the State Superintendent of Schools, 1919-20, p. 81. 
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school health education has developed since 1917. Much of the credit for 
its initiation, promotion, and development in the United States and West 
Virginia should be given to the Child Health Organization of America 
(1917-29). Other private agencies, including the American Child Health 
Association, the National Tuberculosis Association, and the National 
Education Association together with governmental agencies, colleges, and 
universities, provided health leadership. This was evidenced in printed 
materials, the Health Crusade, and teacher education in health. 

Health was stressed as an essential for education by the Report of 
the Commission on the Reorganization of Secondary Education, which 
in 1918 outlined the Seven Cardinal Principles.‘Health was listed as the 
first principle. Recognition was made also of the relationship of health 
to some of the other principles. Thus, the idea that the school’s only 
responsibility was to teach facts was supplanted by the idea that the 
school’s responsibility was to train for citizenship, with due considera- 
tion for the child’s physical, mental, social, and emotional health and 
welfare. 


Until the period of World War I, physical education in elementary 
schools had been limited to calisthenics and exercise. Many secondary 
schools did not offer physical education courses; others had mass athletic 
programs, football and basketball. During the period, 1920-29, as a 
result of the health deficiencies found in the draft examinations for 
World War I, a demand for health and physical education appeared. 
A statewide program was introduced under the supervision of Melville 
K. Stewart, who later was succeeded by Arthur G. Upton. The program 
was greatly aided by the publication of state manuals on health and 
physical education. 


The outstanding difficulty of the new movement in health educa- 
tion in the twenties was an almost entire lack of anything approaching 
acceptable standards of procedure. Various specialists, in addition to 
regular school personnel, were working toward their own objectives, as 
if they had such objectives entirely independent of other workers in 
the field. Gradually, however, the program became better organized 
and coordinated as efforts were made to plan the program in terms of 
the needs of the individual child. This was not true of health education 
alone but of the entire school curriculum. 


Period from 1930-39.—Increased use was made of successful ex- 
‘ periences of the preceding period, such as medical inspection of school 
buildings, medical inspection of school children, interest of schools in 
hygiene instruction, and physical education. The effects of the child- 
centered idea in education were marked. Various sociological develop- 
ments in relation to environment, housing, social reform, and play- 
grounds affected school health education. Of greatest significance was 
the interest of the federal government in health education which re- 
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sulted in the White House Conference on Child Health and Protection 
and crystallized interest in child health. 


The school survey of 1928 pointed out the numerous school build- 
ing needs and paved the way for the improvement of environmental 
conditions pertaining to the health of the school child. However, the 
establishment of rules and regulations governing standardization of 
schools led to better playgrounds, improved water supply, and sanitary 
rural school toilets. 


There was teacher participation in medical inspection accompanied 
by an increase in the number of health supervisors. This resulted in 
better medical examinations. Boards of education were given authority 
to require all teachers to have certificates of good health.15 This recog- 
nition of the relationship of the health of the teacher to the total schooi 
program was of great importance. 


The most pronounced development, however, was shown in the 
preparation and publication of courses of study in health and physical 
education, and the effects of alcoholic drinks and narcotics on the human 
body. On the other hand, the abolition of the office of state director 
of physical education in 1932, due to lack of funds, became a handicap 
to school health education. This action was due to lack of funds charac- 
teristic of the depression. That such action was resented by the people 
is evidenced by the fact that its legality was questioned and the matter 
was taken to the Supreme Court. The latter ruled that the abolition of 
this office was unconstitutional.'® Despite this fact the office has not 
yet been restored 


During the summer of 1931, representatives of the President's 
Organization on Unemployment Relief, following a survey of the 
needs in West Virginia, approached the American Friends Service 
Committee of Philadelphia, and requested them to undertake relief 
work among the children of unemployed miners in the bituminous coal 
fields. After very careful consideration the committee decided to under- 
take this service.17 In September, 1931, feeding was begun by this 
committee. The peak load was: reached in April, 1932, when 33,227 
school children were furnished one substantial meal a day, and in addi- 
tion milk was furnished to 7,697 pre-school children. This project was 
further developed by the Works Project Administration. 


Recent Developments, 1940-45.—West Virginia schools have more 
recently given increased attention to health protection. Three funda- 
mental activities in this area are hygienic arrangement of the school 
program to provide healthful living, the physical education program, 

15 Supplement to the Code of the State of West Virginia, 1933, Chap. 18, Art. 1752. 


16 Biennial Report of the State Superintendent of Schools, 1930-32, p. 18. 
17 Report of the Child Relief Work in the Bituminous Coal Fields, 1932, p. 3. 
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and health instruction. Hygienic arrangement of the school program 
includes administrative planning of recesses, length of recitation periods, 
sequence of subjects, kinds of examinations, amount of homework 
allowed, number of children per class or per teacher, maintenance of 
social and wholesome atmosphere in the classroom, arrangement for 
lunches, and other activities in the school program that are related to 


the child’s health. 


Physical education programs have been expanded and emphasized. 
The High School Victory Corps Program promoted by the United 
States Office of Education and the redefining of the contribution of 
physical education to general education and to physical fitness gave 
impetus to this movement. 


More time is being devoted to health education classes. Often these 
classes or their materials are integrated or correlated with physical edu- 
cation, science, and home economics classes. Health in schools in modern 
times is not a subject but a manner of living, and the experiences pro- 
vided in the more recent teacher's manual lend themselves to this 
concept. 


There was a need for special health classes for children who could 
not successfully carry forward the regular school program because of 
poor physical conditions or some physical defect. A school for the deaf, 
dumb, .and blind was established at Romney in 1870,!8 and at Institute 
in 1926.19 No legal provision was made for other handicapped children 
until 1941.2° In that year funds were provided for an educational pro- 
gram for these children. Included in this program were classes for crip- 
pled children, speech improvement, and the mentally retarded. Funds 
have not been sufficient to include all children needing this kind of 
instruction. 


The modern school health education program recognizes significant 
contributions of the various divisions of the State Department of Edu- 
cation, State Health Department, county health department, state phys- 
ical education association, state nutrition committee, Department of 
Public Assistance, and more recently the W. K. Kellogg Foundation. 
These agencies and organizations in cooperation with the schools are 
responsible for school health education as it exists in West Virginia 
today. 


18 Code of the State of West Virginia, 1870, Art. 8, Sec. 1855. 
19 Biennial Report of the State Superintendent of Schools, 1924-26, p. 145. 
20 Ibid., 1942-4, p. 71. 
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Components of Variation and the Consis- 
tency of Repeated Measurements 


By Joser BrozEK and Howarp ALEXANDER 
Laboratory of Physiological Hygiene 
University of Minnesota 
Minneapolis, Minnesota 


(Submitted for publication, January, 1947) 


HE statistical technique referred to as “analysis of vari- 

ance” has been used in biological work primarily for 

testing significance of the differences between the means 
of two or more sets of measurements by the F ratio. In addi- 
tion, in a set of measurements repeated on a group of individ- 
uals, the technique can be used for estimating the magnitude 
of the components of variation and provides means for deter- 
mining the consistency with which a variable is measured in 
a series of repeated testing trials. 


The procedures and assumptions involved in the statistical 
analysis of the sources of variation have been discussed by 
Snedecor** and Crump.* The problem of reliability has been 
treated in detail by Alexander.' In this paper we shall present 
the fundamental concepts and the rationale of the methods, 
develop the formulae, and give the computational procedures. 
An attempt will be made to treat the subject in a way under- 
standable to a scientist who does not have a technical training 
in statistics. 


1. The concept of variance.—In describing a series of meas- 
urements one would like to know at least two general char- 
acteristics of the series: the mean value and the variability of 
the measurements. Variability is frequently expressed in terms 
of variance. The variance of a series containing n measure- 
ments is equivalent to the standard deviation squared and is 
defined by: 

sum of squared deviations from the mean 
sham degrees of freedom 





(1) 


The “degrees of freedom” in this case represent the number of 
measurements minus one. 

The degrees of freedom represent the minimum number 
of measurements required in order to reconstruct the series 





Mr. Alexander is at present at Adrian College, Adrian, Michigan. 
* Superior figures refer to numbered bibliography at end of article. In ref- 
erence No. 4 note particularly pages 228-261. 
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when the mean is given. If the mean of three measurements 
equals 5, then, knowing that two of the measurements are 4 
and 3, the third one can easily be calculated and is found to 
be 8. Thus in a series of n measurements the degrees of free- 
dom are (n—1). 

Defining y, a deviation of a measurement (Y) from the 
mean of the series (Y), as y=Y—Y, we can write 


nh, 

1 
V=— (la) 

n—1 

n 

p(Y—Y)? 
V—=—— (1b) 

n—1 


2. Score components in a two-way table—Two sets of 
means can be calculated for the data from experiments in 
which n individuals have been measured on k days: (1) The 
means for the individuals, each individual being considered 
over the period of k days, and (2) the means for the days, these 
daily means being based on n individuals. The arrangement 
is represented in Table I. 


TABLE I 


SYMBOLIC REPRESENTATION OF A TWO-WAY TABLE WITH 
k CoLUMNS (DAYS) AND n ROWS (INDIVIDUALS) 


Individual | Individual 





Totals | Totals 
Days k 
SY: Vi 
1 2—-— — k&k 1 
—— 
1 Y, 1 Y, ey Sere eo Yi, | Th Yu 
rs 
3 2 Yo, Yoo a Yo, Tie | Yu 
g wer 
Se | 
= | | 
& | 
n Yur Yue — => — Yu, Zim Yin 
n : a. Ce ee 
Daily Totals 2y,, Pn ag. eee T | 
l | 
‘ — = ——_—_—— ————__—— 
Daily Means Y» Yp1 Ypp — — — Yo | Y 


Y=measurement on any individual on any day. 
Y; ,=measurement on the first individual on the first day, etc. 
T,;,—sum of the k daily measurements on the first individual. 
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Tp,;=sum of n individual measurements on the first day. 

T=sum of all values on all days. 

Y=mean of all values on all days. 

In a two-way table a single measurement or a “score,” Y, 
obtained for a given individual on a given day, can be regarded 
as being composed of four parts: 


Wan +09) 44 ee — 2) +1 —-Y—(8,—Y)—(92—Y)] (2) 
A ae a” oe r (2a) 
where Y=a measurement 
nk 
yd 
Y=—mean of all n.k measurements, "4 
n.K 
k 
SYp 


Y:i—mean of k measurements on an individual, — 


n 
SYp 


Yp—mean of n measurements obtained on a given day, —— 
n 


The four terms on the right side of the equation (2a) rep- 
resent the following components of a measurement: 


Yisa component common to all n.k. measurements and 
indicates the overall level of the scores. Mathematically it is 
defined «is the general mean; 

i cheracterizes the average level of the measurements ob- 
tained on an individual and is defined as the deviation of the 
individu3l mean from the general mean, i—Y,—Y; 

d cheracterizes the average level of the measurements ob- 
tained on a given day and is defined as the deviation of the 
day’s mean from the general mean, d=Y)—Y; and 

r is the residual or random component which remains 
after we have accounted for the portions of the score contrib- 
uted by the general mean, the individual mean, and the daily 
mean, 

r=[Y—Y—(¥,—Y) —(¥p—Y) ] 
From equation 2a the following transposition can be made: 
Y-—Y=i+d+r (2b) 
y=i+d-+4r (2c) 
which indicates that the deviation of a score from the general 
mean (y=Y—Y) is a sum of the components i, d, and r. 

The scores in Table II, simulating a set of measurements 
on five individuals tested on four days, are analyzed into their 
components as indicated in Table III. Each score Y can be 
obtained as the sum of the components Y, i, d, and r. The 
differences, y, between the measurements Y (Table I) and the 
general mean Y (Table III 1) are given in Table IV. 
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TABLE II 


SET OF FICTITIOUS MEASUREMENTS, Y, 
On n INDIVIDUALS OVER k Days 





Days | 
1 2 3 4 T; 4 
l 30 32 28 30 120 ' = 
Indi- 2 19 32 35 30 116 29 
viduals 3 20 38 23 23 104 | 26 
4 33 36 29 38 136 | 34 
5 28 27 40 29 124 | 31 
Tp 130 165 155 150 600 aie 
Yo 26 33 a. 2 30 
TABLE III 
COMPONENTS OF THE MEASUREMENTS Y FROM TABLE II 
Y=Y+i+d+r 
1.Component Y 2.Componenti 3.Componentd 4.Componentr 
Days Days Days Days 
. 2 Bs a oa 2? 4 i: 2 ae 
nian /\—_—_, 
« 1/30 30 3030 00 00-4 3 10 4—1-—3 0 
= 2 30 30 30 30 —1—1—1—1 -—4 3 1 0-6 0 5 1 
oe 30 30 30 30 —4—4-—4—4 -—4 3 1 0-2 9—4-—3 
= 4] 30 30 3030 4 4 4 4-4 3 10 3-1-6 4 
5 5 30 30 30 30 1 1 1 bwit? $i @ 1—7 8—2 


TABLE IV 
DEVIATIONS OF SCORES IN TABLE II FROM THE GENERAL MEAN 
y—Y-—Y=—(i+d-++r) 
Days 
1 2 3 4 





mk | 0.42 -—2 0 
3 2 | —11 423 +5 0 
= 3 —10 +8 —7 —7 
a 4 +3 +6 -—1 +48 
a ae SS 


The sum of the squares of the n.k values of y in Table IV 
can be obtained by adding up the sums of i”, d?, and r?: 
ak nk, nk, nk, 
¥y2=F 12+2 d?2+2 r2 (3) 
644 — 136+ 1304378 
The “degrees of freedom” (nk—1), for the n.k values of y 
also can be partitioned into three quantities corresponding, re- 
spectively, to i, d, and r: 
nk—1=(n-—-1)+(k—1)+(n—1) (k—1) (4) 
3. Variances in a two-way table.—On the basis of the val- 
ues i, d, r, and y we can obtain in a two-way table four vari- 
ances, sometimes referred to as “mean squares”: 
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(1) A variance based on the differences between indi- 
viduals: 
nk 


T 
Vui-=—— (5) 
n—1 


L (2) A variance based on the differences between days: 


i2 


- nk 
> d2 
| Vy Pi. (6) 
d sey 
(3) A variance based on the random components of the 
scores: 


Bi nk 
oy » r2 


= 


(7) 


R 


Va=(a—1) (K—1) 

‘’ (4) A variance representing variation of the scores, irre- 
spective of their classification according to individuals or days, 
or a total variance in the table: 


Vi=——— (8) 


(8a) 





Vr 


~ (n—1)+(k—1) + (n—1) (k—1) 

The variance V, represents a derived term, and is of little 
interest as compared with the first three variances, which were 
obtained independently from each other. 







4. Simple computational formulae for variances in a two- 
way table-—The scheme indicated in Table V greatly simpli- 
fies computation of the terms V;;, Vip, and Vx. This scheme is 
to be used in conjunction with the data as set up in Tables II 
and III. 

TABLE V 


i SCHEME FOR OBTAINING VALUES NEEDED FOR COMPUTATION OF 
Be THE VARIANCES V»:, Vov, AND Va 














ae =(Ti)? 
1 oe 
N,=nk “ae k 
No = 
(To)? (T)2 
z3= bas nk 
a N,=1 
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where Y=any score 
k 
T,— > Y:—sum of the k scores for an individual 
1 


T=? Y,—sum of the n scores for a day 
1 
nk 
T=> Y=grand total 
1 
The numerator in the variance formulae is obtained by 
adding or subtracting the sums 3%), 3», 33, and 34, drawn from 
Table V; the denominator, by following the same pattern of 
additions or subtractions of the corresponding numbers, N,, 
No, Nz, and N,. The operations involved are as follows: 








le—Z 
Vui=—— 9 
, No_N, (9) 
Veesee—ee (10) 
N,—N, 
S.—~Ne-~ Bie 
Vine 2 3+24 (11) 


N, —N.—N3 +N, 


By substituting the terms represented by the symbols = 
and N in the above equations, it can be shown that they are 
identical with the analytical formulae (5), (6), (7), and (8). 

The data in Table II, obtained from random two-digit 
numbers between 19 and 40, yield values of = and N as given 
in Table VI. 


TABLE VI 


VALUES OF ¥ AND N, AND THE VARIANCES Vj), Vip, AND Vx 
FOUND FOR THE DATA FROM TABLE II 


72,544 














>, —18,644 2=— —18,136 
90,650 600) 2 PETTY 0 
x3 18,130 ah. ) —18,000 
Nz 4 | N,=1 
a ees 
18,136—16,000 136 ; 
om —_——-_—-—34.0 
5—1 4 
18,130—18,000 130 
bp —_—_—_—__—_——_—_-=43.3 
4—1 3 
18,644—18,136—18,130+18,000 378 
Vi= —_—— _— ———=31.5 
20—5—441 12 


Using the F ratios, we can test whether the variances be- 
tween individuals and days (V» and V»») are significantly 


larger than the random variance: 
Fyi=Voi/Ve (12) 
Fip=Vov/Ve (13) 
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Substituting the values from Table VI we have 
Fyi1=34.0/31.5=1.079 
Fyp—43.3/31.5=1.374 
The variances entering into the F tests are associated with 
the following degrees of freedom (d.f.): for Vi, d.f.—n-1=—4; 
for Vpp, d.f.—k-1=3; for Vp, d.f.—(n-1) (k-1)=12. Snedecor’s 
Table 10.7‘4. 222) gives the 5 percent level of significance 
for an F associated with 4 and 12 df. as 3.26; for 3 and 12 d., 
the 5 percent point of F is 3.49. Therefore, neither of the two F 
values obtained above reach the level of statistical significance. 
This means that in Table II there is only one important source 
of variation, the random fluctuation of the scores. 


5. Composite total variance.—The terms V», Vpp, and Vr, 
derived from a given sample of n individuals tested on k days, 
may be used for obtaining an estimated total variance in a 
population of an infinite number of such individuals measured 
over an infinite number of days. This composite estimate of 
“population variance” can be expressed in terms of its com- 
ponents as follows: 

o2=0;2+op7+ 072 (14) 
where c2=composite estimate of the total variance, 
o,;2—component due to the differences between the test values ob- 
tained by different individuals 
op2=component resulting from differences between test values ob- 
tained on different days (trials), and 


ox2=—component representing the random fluctuation of the re- 
peated measurements. 


The components cannot be determined directly and must 
be estimated on the basis of the given sample or samples 
studied. The estimate of one of the above terms can be 
obtained as : 

ox2=Ve (15) 
Vp is computed as indicated in formulae 7 and 11. 


The values o;? and op”? are obtained from the terms V), 
and Vp, to which they are related as follows: 
Vir=kK +o12+0x2 (16) Vin=N eop?+or?2 (17) 
where k=the number of days and n=—the number of indi- 
viduals. 


These equations lead to: 
> Varn Ve 


go; += 


(18) ot (19) 
k oa 

The values o;”, op”, and op” indicate the estimated magni- 
tude of the individual, daily, and random component of the 
total variance. 
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For the data of Table II we obtain the following values: 








On2-—=Ver—31.5 
Vori—Vr 34.0—31.5 
o17*= — —0.6 
k 4 
Veo—Vr 43.3—31.5 
op*= 7 = —2.4 
n 5 


The composite estimate of total variance o?, according to 
formula (14), is 


o2—0.642.4431.5—34.5 


Random component contributes the largest part, 91.3 per- 
cent; the individual component, 1.7 percent; and the daily 
component, 7.0 percent of the total. 


6. Average intra-individual and inter-individual variance. 
In characterizing a given set of n individuals studied over a 
period of k days, we can use advantageously an average intra- 
individual variance (i.e., variance between daily measurements 
within individuals, Vy) and an average inter-individual vari- 
ance (i.e., variance between individual measurements within 
days, Vwn) 








k — 
n + (Y,—-Y;) 2 
i: na. 
Vu a (20) 
n 
n —_— 
k > (Yo-Yo)? 
Vv 1 at (21) 
wD— k 


The term V,, indicates the average magnitude of day-to-day 
variation in the scores; Vy» represents the average amount of 
individual - to - individual variation for the given sample of 
subjects. 


Using symbols defined in Section 4, the simple computa- 
tional formula for V,; and Vywp are: 


Rs se 22 
ee (22) 
21—X3 
Vesx 23 
=e (23) 


Using the values given in Table VI: 
18,644—18,136 508 








Vwui= ——=33.9 
20—5 15 
18,664—18,130 514 

Vwp=- - ———_=32.1 
20—4 16 


The variances V,; and Vyp can be thought of as the sum 
of two components: 








Os nar nt ee ER 


= 
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Vwiop2+or2 (24) 
Vwp=012+or2 (25) 
Because of their simplicity, these equations may be used with 
profit for obtaining the values o,* and o;*, two of the estimated 
components of the composite total variance: 
op2=Vwi-—or?27=Vwi— Var (26) 
012 =Vwp—or2=Vyv— Ver (27) 

Equations (26) and (27) facilitate a direct, intuitive under- 
standing of the terms oy,” and o,*: they represent the amount 
of variation arising from the differences between daily scores 
and individual scores, respectively, after the random variation 
in the scores has been removed. 


Using the figures for a given sample we obtain: 


on?2=33.9—31.5=—2.4 
o;2=32.1—31.5—0.6 


The values check with those obtained by formulae (18) 
and (19). 


7. Criteria of consistency of repeated measurements based 
on random variance.—The random variance, Vx, is a measure 
of inconsistency in the measurements repeated on separate 
days (or trials). The random component, r, of each score is 
the discrepancy between the actual score, Y, and the predicted 
score, Y 


r—Y-—Y (28) 


where Y is obtained as a sum of the grand mean, Y, the devia- 
tion of an individual mean (Y;) from the grand mean, and the 
deviation of a day’s mean (Y,) from the grand mean: 


Y=T4(7,-Y)4(%.-Y) (29) 


The larger the discrepancies between the expected (Y) 
and the actual (Y) test scores, the greater the fluctuations in 
the individual’s standing with respect to the performance of 
the group. This lack of consistency is reflected in the magni- 
tude of the random variance, V,. As indicated in equation (7), 


nk. divided 


Vr is obtained as a sum of the r values squared, . 


by the degrees of freedom, (N-1) (k-1). 


The random variance, V, (and its square root, S.D.p, the 
standard deviation of the random components of the test scores 
or the “standard random error” of the scores) is expressed in 
terms of the absolute units of measurement used in a particu- 
lar test, such as kilograms of weight lifted, time of a run in 
seconds, number of taps in a tapping test, etc. The values Vp 
or S.D.,, obtained for different tests, are not directly compar- 





ee 











ee 
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able. Under certain conditions we may compare the con- 
sistency of performance in different tests by expressing the 
S.D., as the percentage of the grand mean. 


It is much more satisfactory to evaluate the magnitude of 
the random variation in reference to the variability of the 
scores obtained by the different individuals. The best estimate 
of the amount of inter-individual variation is the value Vyp, 
the average inter-individual variance or the variance within 
days, defined in the equation (21). 


The comparison may take a form of a simple percentage 
ratio: 











Vi 
Vr%= x 100 (30) 
wb 
or of a coefficient of consistency, rc: 
V 
ee (31) 
wb 
This can also be written as 
Vwo—Ve 
'o=" (31a) 
Vwp 


When V,=O, i.e., when there is no random variation in the 
scores, the value of the coefficient of consistency re=1; this 
means that the scores obtained by different individuals on dif- 
ferent days are identical or differ from day to day by a factor 
which is the same for all individuals. The larger the Va, at 
the same Vp, the closer the re approaches zero. When Vp= 
Vwp, To—0. 


This consistency coefficient is determined, as indicated in 
formula (30), by two factors: (1) the amount of random vari- 
ation, measured as Vp, and (2) the amount of inter-individual 
variation, expressed as Vywp. It may be helpful to repeat that 
Vwp is the variance between the scores obtained by the indi- 
viduals comprising the sample, averaged for the k days on 
which the measurements were repeated. In laboratory investi- 
gations of fitness, the subjects usually represent a very homo- 
geneous group with a narrow spread in age, health, perform- 
ance capacity, etc. The greater the homogeneity, i.e., the 
smaller the differences between individuals comprising the 
sample studied, the lower—for the same magnitude of random 
variation in the scores—will be the coefficient of consistency, rc. 


To evaluate correctly the magnitude of the random error 
we should know the “true” variance of the individual scores in 
the population. No amount of statistical gymnastics can pro- 
vide this datum. The information can be obtained only by 
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testing a number of individuals large enough to yield a truly 
representative sample of the population. When we are con- 
cerned with tests in which training does not affect the inter- 
individual variance in successive trials, the task is well within 
the reach of realization. Where the inter-individual variance 
does change with practice, the obstacles to obtaining an inter- 
individual variance for the population are discouraging. 


The consistency coefficient, r-, can be expressed in terms 
of the estimated components of the total composite variance: 
o12 


SS 
o1-+Cr- 


(32) 

It is apparent that the daily component, op”, of the total 
composite variance does not enter in; the value of r¢ is not 
affected by the variations in the daily (trial) means and can, 
therefore, be used for evaluation of the consistency of repeated 
measurements in which a trend is present. 


The rg yields for a series of repeated trials a consistency 
coefficient comparable to the ordinary product-moment coeffi- 
cient of correlation, used when only two test trials are avail- 
able. If we assume that the inter-individual variances in suc- 
cessive testing trials are homogeneous, then the rg represents 
the average of the product-moment correlation coefficients 
computed for the k testing trials, taken two at a time; there are 
k(k-1) 

2 
product-moment correlation. 





such pairs, yielding an equal number of coefficients of 


The formula for rg can be written in a form which brings 
out the relationship of rq to the coefficient of intra-class corre- 
lation: »- 245) 

Voi—Vrk 
re Vert (k—1) Vi 
where k=number of trials (days) and V,,,—variance “between 
individuals,” defined in equation (5). 





(33) 


Using the values obtained in our fictitious sample of 5 
individuals measured on 4 days we obtain: 








Vr 31.5 
ro=1— —1————=1—0.981—0.019 
Vwn 32.1 
o;2 0.6 0.6 
re = ———0.019 
o,*+or2 0.6+-31.5 32.1 
Voi—Vr 34.0—31.5 2.5 


re=- — 


=——— = ——- ~~ _ 9,019 
Virt+(kK—1) Ve 34.04(4—1) 31.5 131.0 
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8. Statistical significance of a coefficient of consistency, 
ro.—The statistical significance of a given coefficient of con- 
sistency is obtained in the form of an F test: 


Pasa (34) 
‘C— Vi 
The values for data in Table II yield 
34.0 
o= aT =1.079 
31.5 


associated with 4 and 12 degrees of freedom. The correspond- 
ing 5 per cent point for the F test is 3.26. Our Fr¢ is far smaller 
and cannot be considered statistically significant. 


9. Statistical significance of a difference between two val- 
ues of re.—In comparing the consistency of two different func- 
tions, and of the same function measured under different con- 
ditions or with different techniques, we want to know whether 
the difference between the two values of rg is statistically sig- 
nificant. The procedure for making such comparisons will be 
essentially identical with that provided by Fisher. ‘+: 2%4*) 

The steps are (1) to determine the range in which the 
values of re could be expected to fall due simply to random 
sampling, and (2) to examine whether or not the ranges cor- 
responding to the given two values of re overlap. Overlap of 
the ranges would indicate that the two values could be thought 
of as characteristics of two samples drawn from the same 
population, ie., the difference would not be considered as 
statistically significant. 

10. Practical significance of a given re.—No definite limits 
have been established for evaluating the practical significance 
of the consistency coefficient. The consistency demanded from 
a test will depend in part on the purpose for which the data 
will be used, and useful generalizations will emerge only from 
the actual study of the consistency of various functions meas- 
ured in connection with experimental studies on human fitness. 

Provided that the group of subjects under study is reason- 
ably heterogeneous with respect to its fitness characteristics, 
tentative evaluative limits may be set as follows: 0.9l=rce=1.00 
would indicate a very high consistency of the repeated meas- 
urements, 0.8l=r,-=0.90 high consistency, 0.71=r~e=0.80 mod- 
erately high consistency, and r~e=0.70 low consistency. 


11. Examples.—In order to make the presentation more 
concrete, two sets of actual data will be analyzed in terms of 
the methods described in the preceding sections. The data 
included in Table VII represent test scores obtained by the 
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mental regimen were given elsewhere.” 


TABLE VII 


Set oF ACTUAL DATA OBTAINED ON EIGHT SUBJECTS MEASURED IN 


PEATED TESTING SESSIONS 


A. Tapping Speed, total number of taps in 10 seconds 


Days of the Experiment 


Subjects 45 78 117 141 T, 
G 62 64 65 65 256 
W, 74 72 69 72 287 
Ww, 79 80 75 75 309 
T 75 75 67 70 287 
N 72 76 72 75 295 
Ss 73 74 71 71 289 
Jo 67 72 72 75 286 
Ja 67 77 69 66 279 
Tp 569 590 560 569 2288 
8 

7(Yp) 2 40,677 43,670 39,270 40,581 

8 

z(t) 2— 655,938 

4 

#(To) 2—1,309,222 

T? —5,234,944 


B. Coordination, number of errors 


Days of the Experiment 


Subjects 45 78 117 141 T: 
G 33 33 22 17 105 
W, 26 29 27 21 103 
W, 16 16 12 11 55 
st 35 25 27 25 112 
N 35 29 30 20 114 
Ss 37 31 26 16 110 
Jo 43 30 25 24 122 
Ja 52 40 43 26 161 
Tp 277 233 212 160 882 
8 

¢(Yv) 2 10,393 7,113 6,136 3,384 

8 

Ss 


5(Tp)2= 201,562 


1 
4 
>(T:)2= 103,104 
T2 = 177,924 


+ 
DY? 
1 


16,390 
20,605 
23,891 
20,639 
21,769 
20,887 
20,482 
19,535 


164,198 


as RA 
“ 
te 


2,951 
2,687 

777 
3,204 
3,366 
3,262 
3,950 
6,829 


27,026 





individual subjects in a two-plate tapping test and in a test of 
coordination. The details of testing methods and the experi- 
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Table VII contains both the raw data and the derived 
totals. The = and N values needed for the computation of the 
sample variances and the derived values are given in Table 
Vill. 


TABLE VIII 


COMPUTATION OF SAMPLE VARIANCES, ESTIMATED VARIANCE COMPONENTS, 
F-TrEsts OF SIGNIFICANCE OF THE DIFFERENCES BETWEEN INDIVIDUALS 
AND BETWEEN DAYS, AND COEFFICIENTS OF CONSISTENCY 





A B 
Tapping Speed Coordination 
“3, =164,198 | Y.—163,984 ~~ 3,=27,026 |  %.=—25,776 
N,=32 No.8 : N, =—32 a N2,=8 we 
Y3 —163,653 D4=—163,592 5 Y3=25,195 D4=—24,310 
Ng — | _ Ny =! Far p Nz =—4 rs Ny= a 

Vn1=392/7 = 56.00 Vii=1,466/7 =209.43 
Ven= 61/3 = 20.33 Viv= 885/3 =295.00 
Vr=153/21 = 17.29 Ve= 365/21 = 17.38 
V wi=214/24 = 8.92 Vw1=1,250/24 = 52.08 
V wv 545/28 = 19.46 Vwn=1,831/28 = 65.39 
o:2= 12.17 57.7% oi2= 48.01 48.0% 
op 2: 1.63 7.7% ov2= 34.70 34.7% 
or 7.29 34.6% on2= 17.38 17.3% 
o2= 21.09 100 % 2—100.09 100 % 
Fi:= 56.00/7.29— 7.68 F,,=209.43/17.38= 12.05 
Fy, 20.33/7.29— 2.78 Fy »=295.00/17.38=— 16.97 

r 0.625 re= 0.734 


In the tapping test we have only one source of variation 
which is significantly greater than the random error, namely 
the differences between individuals. The coefficient of correla- 
tion appears to be low, even though it is statistically significant. 


The low value is due primarily to the extreme homogen- 
eity of the individuals comprising the sample. The standard 
random error of the repeated measurements is small (S.D.p= 
VVr=2.7), representing only 3.8 percent of the mean score 
(Y=71.5). 


The scores in the cordination test exhibit a definite trend 
toward lower values as the time goes on; both “days” and 
“individuals” are significant sources of variation. The coeffi- 
cient of correlation is somewhat higher than in the tapping 
test. 


12. Criteria of consistency of repeated measurements based 
on the variance within individuals.—The variance within in- 
dividuals, V,;, and its square root, S.D.,;, the average standard 
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deviation of daily measurements about the individual means, 
indicate the amount of day-to-day variation in the scores. 

It has been shown (formula 24) that in terms of the esti- 
mated components of the population variance, the variance 
within individuals, V,;, can be expressed as a sum of the daily 
component, op”, and the random component, ox”: 

Vwiron?+or? 

In the absence of significant differences between the daily 
means, i.e., when op? is negligible, the V,,; can be used in place 
of the random variance, Vx. 


Using Vwi, the coefficient of day-to-day consistency (r’) of 
repeated measurements may be expressed in the form of intra- 
class correlation: 

A Vor —V.« I 
"Vert (ie—1) Vor 
For details refer to reference No. 1. 


(37) 
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Park Recreation: No report. 

Private Recreation: Harry D. Edgren. 
School Recreation: Ruth Abernathy. 


General Sections 
City Directors: Pattric Ruth O'Keefe. 
Measurement and Evaluation: Esther 
French. 
Professional and 
Howard Danford 
Professional Education: Ralph Piper. 
Research: Thomas K. Cureton. 
Student: Nancy Neville. 
Unofficial 
Aquatic: Jack Tourney. 
Elementary Physical Education: Ed- 
wina Jones. 


Public Relations: 


District Association Presidents 
Central: Elizabeth Graybeal. 
Eastern: Clifford L. Brownell. 
Midwest: Leon G. Kranz. 
Northwest: Dorothea Lensch. 
Southern: Joy W. Kistler. 
Southwest: Verne Landreth. 

State Representatives 
(Central District) 
Colorado (2): Maisie Hoyt; Willard 
N. Greim. 


Iowa (2): Theresa Anderson; Roy 
Moore. 

Kansas (2): L. P. Washburn; Edna 
McCullough. 


Minnesota (2): P. J. Sandell; Mabel 
J. Shirley. 

Missouri (3): Muriel M. Lomax; 
Helen Fahey; Donald Ingerson. 

Nebraska (1): Ed Higginbotham. 

North Dakota (1): E. J. Cassell. 

South Dakota (1): Irv Nelson. 

Wyoming (0): None. 

(Eastern District) 

Connecticut (2): Joseph V. 
Josephine Rogers. 

Delaware (1): George W. Ayars. 

District of Columbia (2): Julian N. 
Colangelo; Virginia Dennis. 

Maine (1): Howard G. Richardson. 

Maryland (1): Thomas C. Ferguson. 


Burns; 








Massachusetts (3): Robert Berry; 
Lawrence E. Briggs; Daniel Kelly. 

New Hampshire (0): None. 

New Jersey (3): Margaret Millar; 
Herbert A. Stine, Margaret C. 
Brown. 

New York (5): Ellis Champlin; Ethel 
Kloberg; Frank S. Lloyd; Francis 
J. Moench; Carroll H. Smith. 

Pennsylvania (4): C. H. Schuler; 
William A. Palmer; Wilfred C. 
Collins; E. H. Nelson. 

Rhode Island (1): Martha Jones. 

Vermont (1): Howard Douglas. 

(Midwest District) 

Illinois (5): Ray Duncan; Ruth Lins; 
Norman Ziebell; Bess Specht; Ross 
Anderson. ° 


Indiana (2) : Carolyn W. Bookwalter ; 
Mark C. Wakefield. 


Michigan (2): Dorothy J. Parker; 
Augusta Harris. 
Ohio (4): Paul Landis; Grace B. 


Daviess ; Edwina Jones; George Ko- 
zak. 

West Virginia (2): Patrick A. Tork; 
Sara Cree. 

Wisconsin (2): George Wolf; Clif- 
ford Fagan. 

(Northwest District) 

Idaho (0): None. 

Montana (1): Charles F. Hertler. 

Oregon (2): Ray Hendrickson; Dor- 
otha M. Massey. 

Washington (2): 
Jesse Puckett. 


Kathro Kidwell; 


(Southern District) 


Alabama (2): Louise E. Levy; Jessie 
R. Garrison. 
Arkansas (1): Paul McFarran. 
Florida (1): Grace Fox. 
Georgia (2): Thomas E. 
ough; Gertrude Manchester. 
Kentucky (1): Elizabeth Ferver. 
Louisiana (2): J. W. Kistler; Simon 
A. McNeely. 


McDon- 
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Mississippi (1): Russell Lyons. 

North Carolina (2): Charles E. Spen- 
cer; Pete Mullis. 

Oklahoma (1): Earl Sullens. 

South Carolina (1): Lucille Bowers. 

Tennessee (1): Emmett Strickland. 

Texas (2): J. S. Nicoll; Bonnie Cot- 
teral. 

Virginia (2): Eliot V. Graves; Kirk 
Montague. 


(Southwest District) 
Arizona (1): John L. Barringer. 
California (4): Anna Espenschade; 
Howard Bell; Verne Landreth; 
Genevie Dexter. 
Nevada (0): None. 
New Mexico (1): Caskey Settle. 
Utah (1): Sheilah Woodland. 


Affiliated Organizations 


American Academy of Physical Edu- 
cation: C, H. MeCloy. 

American Physiotherapy Association: 
Ellen Kelly. 

American School Health Association: 
C. H. Keene. 

Boys’ Clubs of America, Incorporated: 
Walter M. Hall. 

Canadian Physical Education Associa- 
tion: A. S. Lamb. 

College Physical Education 
tion: Glenn W. Howard. 

Delta Psi Kappa: Eurice Miller. 

National Association of Physical Edu 
cation for College Women: Gertrude 
Manchester. 

National Collegiate Athletic Associa- 
tion: Frederick W. Luehring. 

Phi Delta Pi: Bernice Moss. 

Phi Epsilon Kappa: Carl F. Trieb. 

Physical Education Society of the 
YMCA’s of North America: Mar- 
shall L. Walters. 

Society of State 
Spencer. 


YWCA Health 


Associa- 


Directors: Charles 


Education Directors’ 


Society : Grace M. Palmer. 











